Cannot get take_test.py to run due to TypeError:vecquant4matmul()


Running on Windows 10, conda build environment, I think I have all the right torch, Cuda etc modules installed, requirements.txt all installed well. Running the NOTA (None of the Above) Trivia test on Alpaca Lora 7B (4bit)

python take_test.py --trivia fake_trivia_questions.json

model and weights installed from huggingface repos and in the correct directories, but there is some error before the first question about tokenizer class being different, so coudl be related to that.... Anyway get the following error:

Found 1 GPU(s) available.
Using device: cuda:0
Loading Model ...
The tokenizer class you load from this checkpoint is not the same type as the class this function is called from. It may result in unexpected tokenization.
The tokenizer class you load from this checkpoint is 'LLaMATokenizer'.
The class this function is called from is 'LlamaTokenizer'.
Loaded the model in 2.36 seconds.
Fitting 4bit scales and zeros to half
Question 1: What type of energy is used to power the Salkran Tower of Pertinax in the Carpathian Mountains of Romania?
    A. Solar
    B. Wind
    C. Gravitonic
    D. None of the above
    E. I don't know
Traceback (most recent call last):
  File "D:\devgit\haltt4llm\take_test.py", line 189, in <module>
    main()

....

TypeError: vecquant4matmul(): incompatible function arguments. The following argument types are supported:
    1. (arg0: torch.Tensor, arg1: torch.Tensor, arg2: torch.Tensor, arg3: torch.Tensor, arg4: torch.Tensor, arg5: int) -> None

Invoked with: tensor([[-0.0148, -0.0238,  0.0097,  ...,  0.0231, -0.0175,  0.0318]],
       device='cuda:0'), tensor([[ 2004248423,  2020046951,  1734903431,  ..., -2024113529,
         -1772648858,  1988708488],
        [ 2004318071,  1985447543,  1719101303,  ...,  1738958728,
          1734834296,  1988584549],
        [-2006481289, -2038991241,  2003200134,  ..., -1734780278,
         -2055714936, -1401572265],
        ...,
        [-2022213769, -2021226889,  1735947895,  ...,  2002357398,
          1483176039, -1215859063],
        [ 2005366614, -2022148249,  1752733576,  ...,   394557864,
          1986418055,  1483962710],
        [ 1735820935,  1988720743, -2056755593,  ..., -1468438152,
          1718123383,  1150911352]], device='cuda:0', dtype=torch.int32), tensor([[0., 0., 0.,  ..., 0., 0., 0.]], device='cuda:0'), tensor([[0.0318],
        [0.0154],
        [0.0123],
        ...,
        [0.0191],
        [0.0206],
        [0.0137]], device='cuda:0'), tensor([[0.2229],
        [0.1078],
        [0.0860],
        ...,
        [0.1528],
        [0.1439],
        [0.0959]], device='cuda:0')


Any ideas why?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cannot get take_test.py to run due to TypeError:vecquant4matmul() #3

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Cannot get take_test.py to run due to TypeError:vecquant4matmul() #3

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions