INDEX
Explanations
phrase patterns involving the preposition "of."
New Auto-Interp
Negative Logits
odal
-0.15
igi
-0.15
κοÏĤ
-0.14
angan
-0.14
hyp
-0.14
rust
-0.14
ohon
-0.14
aft
-0.13
ays
-0.13
ETF
-0.13
POSITIVE LOGITS
rts
0.15
drained
0.15
gent
0.14
uso
0.14
orus
0.14
Gand
0.14
bern
0.14
Strand
0.14
906
0.14
bert
0.13
Activations Density 0.108%