INDEX
Explanations
phrases referring to past states or previous conditions
references to time, specifically indicating past conditions or states
New Auto-Interp
Negative Logits
asso
-0.76
agent
-0.74
anders
-0.74
rics
-0.73
aby
-0.72
ighter
-0.71
vez
-0.70
anche
-0.70
ento
-0.70
atro
-0.69
POSITIVE LOGITS
Era
0.78
era
0.74
incarn
0.73
fashioned
0.72
thriving
0.69
forb
0.69
defunct
0.68
incarnation
0.68
eras
0.68
precedent
0.68
Activations Density 0.277%