Skip to content

Commit b2379fd

Browse files
[Perf]: adjust apply_rotary api
Signed-off-by: MagnetoWang <[email protected]> Co-authored-by: cynthieye <[email protected]>
1 parent e630289 commit b2379fd

File tree

1 file changed

+1
-2
lines changed

1 file changed

+1
-2
lines changed

vllm/model_executor/layers/rotary_embedding.py

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -34,8 +34,7 @@
3434
from vllm.platforms import current_platform
3535

3636
if is_flash_attn_2_available():
37-
from flash_attn.ops.triton.rotary import apply_rotary
38-
37+
from flash_attn.layers.rotary import apply_rotary
3938

4039
def _rotate_neox(x: torch.Tensor) -> torch.Tensor:
4140
x1 = x[..., :x.shape[-1] // 2]

0 commit comments

Comments
 (0)