INDEX
Explanations
phrases or terms referring to previous instances or historical context
references to prior content or events
New Auto-Interp
Negative Logits
ILCS
-0.81
bucks
-0.75
Franch
-0.74
Himself
-0.72
abad
-0.72
pter
-0.71
atown
-0.70
alion
-0.70
icles
-0.70
csv
-0.68
POSITIVE LOGITS
generations
1.26
incarnation
1.19
incarn
1.12
iterations
1.12
ebin
1.11
administrations
1.10
editions
1.08
eras
1.07
installments
1.06
iteration
1.02
Activations Density 0.059%