INDEX
Explanations
mentions of specific locations or organizations, particularly related to serious events or history
mentions of specific geographical locations
New Auto-Interp
Negative Logits
Entity
-0.73
VALUE
-0.70
TPPStreamerBot
-0.67
IPS
-0.67
PATH
-0.67
Ent
-0.66
Tokens
-0.66
NASCAR
-0.65
ECT
-0.65
INT
-0.65
POSITIVE LOGITS
reb
1.35
acus
0.87
unal
0.86
vernment
0.84
oli
0.83
aternity
0.82
undy
0.81
axter
0.77
ruary
0.77
iotics
0.75
Activations Density 0.020%