INDEX
Explanations
locations or destinations
mention of countries and geographic locations
New Auto-Interp
Negative Logits
NUM
-0.59
arithmetic
-0.59
replay
-0.58
severity
-0.58
NUM
-0.58
summary
-0.57
exponent
-0.56
Participation
-0.56
resemblance
-0.56
importance
-0.55
POSITIVE LOGITS
shores
0.90
via
0.87
destinations
0.84
airst
0.73
undet
0.72
illegally
0.72
beck
0.71
docks
0.71
fray
0.71
agar
0.67
Activations Density 0.266%