INDEX
Explanations
references to forms, organization, and structure in procedural or technical contexts
New Auto-Interp
Negative Logits
ubre
-0.17
istrovstvÃŃ
-0.15
icht
-0.15
piger
-0.15
Barrier
-0.15
opia
-0.14
adesh
-0.14
endir
-0.14
istrat
-0.13
Ñģл
-0.13
POSITIVE LOGITS
filled
0.24
Filled
0.20
contents
0.19
contents
0.17
occupied
0.17
filled
0.17
/column
0.17
empty
0.17
inh
0.16
pler
0.15
Activations Density 0.273%