INDEX
Explanations
names of specific locations or institutions
words associated with extreme events or conditions
New Auto-Interp
Negative Logits
Vaugh
-0.58
prest
-0.58
referen
-0.56
corrid
-0.54
thous
-0.53
sic
-0.52
_.
-0.52
destro
-0.52
disadvant
-0.51
challeng
-0.51
POSITIVE LOGITS
Kingdoms
0.59
Rewards
0.53
GOODMAN
0.51
Profile
0.51
Hedge
0.51
Streamer
0.51
Seasons
0.50
âĢº
0.50
Scene
0.49
Shin
0.49
Activations Density 1.753%