INDEX
Explanations
historical and religious figures' names
historical figures and events
New Auto-Interp
Negative Logits
tools
-0.72
malink
-0.70
ratom
-0.70
feature
-0.70
rador
-0.70
Wisconsin
-0.68
machine
-0.68
REAM
-0.67
FU
-0.67
LOAD
-0.67
POSITIVE LOGITS
Galile
1.22
XVI
1.20
Augustus
1.19
Herod
1.18
VIII
1.16
Tud
1.15
Claud
1.13
Romans
1.13
XIV
1.12
XII
1.12
Activations Density 0.244%