INDEX
Explanations
government-related terms
New Auto-Interp
Negative Logits
potion
-0.71
CHAT
-0.68
morning
-0.66
Recommend
-0.64
selection
-0.64
Kara
-0.63
pier
-0.62
Factor
-0.61
Channel
-0.61
Bay
-0.61
POSITIVE LOGITS
hips
1.15
chool
1.06
hip
1.01
agascar
0.87
mith
0.80
ystem
0.80
governments
0.80
collide
0.80
empires
0.78
alike
0.78
Activations Density 0.184%