INDEX
Explanations
phrases related to emotional relationships with food
punctuation and speech-related phrases
New Auto-Interp
Negative Logits
Wilde
-0.84
eren
-0.82
Olympus
-0.79
OE
-0.78
urtles
-0.77
erton
-0.77
Bott
-0.76
bott
-0.76
orio
-0.76
ois
-0.75
POSITIVE LOGITS
Kh
2.41
Kh
2.37
kh
2.08
Khan
1.96
kh
1.66
Gh
1.59
KH
1.56
Khal
1.49
Gh
1.44
Kazakh
1.33
Activations Density 0.195%