INDEX
Explanations
references to volumes and methodologies in academic contexts
Roman numerals or letters in parentheses
roman numerals and cause
New Auto-Interp
Negative Logits
StrictEqual
-0.51
ivelany
-0.47
ange
-0.45
naka
-0.45
coni
-0.43
CONCEP
-0.42
caufe
-0.42
cauſe
-0.41
mato
-0.41
ſch
-0.40
POSITIVE LOGITS
III
0.90
III
0.88
VIII
0.84
XXX
0.83
XII
0.82
IV
0.82
VIII
0.80
VII
0.80
VII
0.78
0.78
Activations Density 0.392%