INDEX
Explanations
phrases that refer to prior instances or comparisons to earlier states or data
references to prior events or periods in time
New Auto-Interp
Negative Logits
lee
-0.84
ILCS
-0.76
hots
-0.74
kamp
-0.73
rage
-0.73
aliation
-0.73
lua
-0.71
ocracy
-0.69
rosso
-0.68
umes
-0.67
POSITIVE LOGITS
generations
1.08
incarnation
1.04
occupant
0.99
incarn
0.91
ebin
0.90
batch
0.89
generation
0.88
iteration
0.84
editions
0.80
vernment
0.79
Activations Density 0.023%