INDEX
Explanations
specific entity names or terms, potentially related to investigative or legal contexts
references to people, particularly in the context of relationships or events involving them
New Auto-Interp
Negative Logits
yz
-0.71
PN
-0.66
Compare
-0.65
GMT
-0.63
SIGN
-0.63
Pwr
-0.61
Presidents
-0.61
Anonymous
-0.60
GBT
-0.60
TRUMP
-0.59
POSITIVE LOGITS
grandson
0.86
wered
0.85
own
0.85
alty
0.82
ullivan
0.82
terday
0.80
granddaughter
0.79
anton
0.77
son
0.77
footsteps
0.77
Activations Density 0.143%