INDEX
Explanations
educational institutions and related activities
New Auto-Interp
Negative Logits
potion
-0.79
CHAT
-0.73
tein
-0.71
otos
-0.69
Accessory
-0.63
ultimate
-0.63
bush
-0.60
proof
-0.58
rontal
-0.57
Timer
-0.57
POSITIVE LOGITS
hips
1.43
chool
1.17
hare
1.10
paces
1.08
alike
1.05
ystem
1.01
nationwide
0.99
folk
0.98
hops
0.97
vying
0.97
Activations Density 1.739%