INDEX
Explanations
phrases related to communication and interpersonal interactions
New Auto-Interp
Negative Logits
tram
-0.70
mercial
-0.68
commercial
-0.65
psychiat
-0.64
board
-0.64
subsid
-0.62
range
-0.62
vulner
-0.61
paved
-0.61
aerial
-0.61
POSITIVE LOGITS
¹
1.38
ľ
1.33
ª
1.33
¡
1.27
Ķ
1.24
´
1.22
©
1.21
Ŀ
1.20
IJ
1.18
¦
1.18
Activations Density 0.126%