INDEX
Explanations
references to historical events or figures
references to historical figures and their contributions
New Auto-Interp
Negative Logits
receivers
-0.69
surrender
-0.69
sums
-0.66
vow
-0.65
plat
-0.64
infringing
-0.62
sender
-0.62
indictment
-0.61
imprisonment
-0.61
Proposition
-0.61
POSITIVE LOGITS
orically
1.67
ically
1.61
orical
1.57
oric
1.50
icist
1.50
ICAL
1.47
ician
1.45
icians
1.44
orians
1.38
ical
1.36
Activations Density 0.058%