Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

enabling Intel(R) Extension for PyTorch* #16

Open
wants to merge 4 commits into
base: main
Choose a base branch
from

Conversation

kta-intel
Copy link

this PR aims to integrate Intel Extension for PyTorch into TGIS so that users may make use of transformers optimizations to maximize performance on Intel CPU.

changes are to the dockerfile and includes an additional IPEX deployment framework that includes the optimizations

Signed-off-by: kta-intel <[email protected]>
@kta-intel kta-intel marked this pull request as draft November 30, 2023 14:04
@kta-intel kta-intel marked this pull request as ready for review January 17, 2024 16:27
@kta-intel
Copy link
Author

Note: PR comments out the nightly build as IPEX version must align with the torch version (i.e. stable torch 2.1 and stable IPEX 2.1)

JRosenkranz pushed a commit to JRosenkranz/text-generation-inference-server that referenced this pull request Jul 10, 2024
… padding (IBM#16)

AFAIK there is no torch device type called "gpu".
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant