INDEX
Explanations
overall structure or layout elements in the text
New Auto-Interp
Negative Logits
[]:
-0.50
geschlossen
-0.50
oom
-0.48
FOOTNOTES
-0.47
Tembelea
-0.46
espec
-0.46
]").
-0.45
tight
-0.45
タル
-0.45
dope
-0.44
POSITIVE LOGITS
prefix
1.22
prefix
1.19
PREFIX
1.10
Prefix
1.04
prefixes
1.00
Prefix
0.99
PREFIX
0.96
prefixes
0.81
startswith
0.80
насељу
0.77
Activations Density 0.352%