INDEX
Explanations
references to historical figures and philosophical concepts
historical figures and events
New Auto-Interp
Negative Logits
ThroughAttribute
-0.67
informée
-0.63
iglesia
-0.62
كورة
-0.59
***!
-0.57
oa̍t
-0.57
iſten
-0.55
виправивши
-0.54
enablog
-0.53
XtraBars
-0.53
POSITIVE LOGITS
historical
0.50
Historical
0.40
Historical
0.40
historic
0.39
history
0.37
antique
0.36
Historic
0.35
historian
0.35
legado
0.33
Pos
0.33
Activations Density 0.058%