INDEX
Explanations
mentions of past events or historical contexts
references to nostalgic or past experiences
New Auto-Interp
Negative Logits
IDA
-0.79
2018
-0.76
wake
-0.75
oday
-0.69
2018
-0.68
2015
-0.68
2017
-0.67
>>>
-0.67
Retrieved
-0.66
expire
-0.65
POSITIVE LOGITS
unthinkable
0.85
unimaginable
0.75
typew
0.72
taboo
0.65
RPGs
0.65
feared
0.65
computers
0.64
primitive
0.64
invincible
0.63
fashionable
0.63
Activations Density 0.413%