INDEX
Explanations
instances of the word "vor" in various inflections and contexts
New Auto-Interp
Negative Logits
McMahon
-0.16
cms
-0.16
baugh
-0.15
sdale
-0.14
Rum
-0.14
ë¦
-0.14
ADE
-0.14
Obs
-0.14
arası
-0.14
dao
-0.14
POSITIVE LOGITS
allem
0.26
Ort
0.24
rang
0.23
arl
0.22
her
0.19
acious
0.17
beh
0.17
ause
0.17
rats
0.17
zeitig
0.17
Activations Density 0.005%