INDEX
Explanations
expressions related to guidance and leadership efforts among youth
New Auto-Interp
Negative Logits
iros
-0.19
astle
-0.16
voie
-0.15
راÙĩ
-0.14
ISON
-0.14
aus
-0.14
ritch
-0.14
utar
-0.14
cke
-0.14
iol
-0.14
POSITIVE LOGITS
Shank
0.15
Tow
0.15
Tang
0.14
towards
0.14
toward
0.14
hearts
0.14
Zy
0.14
aggreg
0.14
oward
0.14
Towards
0.14
Activations Density 0.436%