INDEX
Explanations
references to specific individuals or names
New Auto-Interp
Negative Logits
cription
-0.21
ifications
-0.20
ification
-0.20
ication
-0.20
cation
-0.20
ctions
-0.20
lications
-0.20
istrovstvÃŃ
-0.20
lectual
-0.20
icients
-0.20
POSITIVE LOGITS
adj
0.22
and
0.21
add
0.21
okay
0.19
angi
0.19
arkan
0.19
all
0.19
ohana
0.18
abbo
0.18
arr
0.18
Activations Density 0.607%