INDEX
Explanations
references to specific healthcare facilities or institutions
Followed by numbers or names
names of people and places
New Auto-Interp
Negative Logits
[…]
-1.28
…
-0.99
-0.83
...
-0.82
[...]
-0.75
<eos>
-0.74
[…]
-0.74
.
-0.73
…
-0.72
-0.71
POSITIVE LOGITS
Савезне
1.81
Personensuche
1.46
tagHelperRunner
1.41
pleaſure
1.37
Мексичка
1.32
:✨
1.32
Majefty
1.31
Roskov
1.31
purpoſe
1.28
ſelves
1.28
Activations Density 0.019%