INDEX
Explanations
words related to body parts and injuries
New Auto-Interp
Negative Logits
odega
-0.17
isci
-0.16
avou
-0.14
áf
-0.14
intage
-0.14
illance
-0.14
throat
-0.14
obili
-0.14
rud
-0.14
orst
-0.14
POSITIVE LOGITS
/body
0.22
éĻħ
0.16
-region
0.16
-area
0.16
area
0.15
region
0.15
EXEMPLARY
0.15
eh
0.14
/head
0.14
á»iji
0.14
Activations Density 0.091%