INDEX
Explanations
phrases related to current events
mentions of events or occurrences
New Auto-Interp
Negative Logits
ilts
-0.78
phies
-0.78
cot
-0.74
alty
-0.73
pose
-0.73
rete
-0.72
issue
-0.71
Parables
-0.71
oug
-0.71
ocl
-0.71
POSITIVE LOGITS
uate
0.94
unfolding
0.75
unfold
0.74
uated
0.73
everywhere
0.73
uates
0.73
Archdemon
0.72
around
0.69
uating
0.68
havoc
0.68
Activations Density 0.032%