INDEX
Explanations
names or proper nouns
the last names of people mentioned in the document
New Auto-Interp
Negative Logits
¥ŀ
-0.97
ĺħ
-0.86
ĸļ
-0.84
ŃĶ
-0.82
livest
-0.73
categ
-0.72
Surviv
-0.70
unanimous
-0.68
vegetarian
-0.66
spicy
-0.66
POSITIVE LOGITS
imer
1.26
osal
1.02
ieth
0.88
oon
0.80
glass
0.79
olerance
0.78
ipl
0.78
olin
0.75
acht
0.74
IELD
0.74
Activations Density 0.013%