INDEX
Explanations
words related to physical injuries or pain
words related to various types of food and their preparation methods
New Auto-Interp
Negative Logits
acqu
-0.78
Samar
-0.63
iage
-0.62
rave
-0.58
Aur
-0.58
ku
-0.57
raise
-0.56
recomm
-0.56
resolution
-0.54
iazep
-0.54
POSITIVE LOGITS
xus
0.82
atism
0.79
Sport
0.67
othy
0.65
acus
0.65
ulent
0.63
ulence
0.63
Obj
0.62
å°Ĩ
0.62
ombat
0.62
Activations Density 0.059%