INDEX
Explanations
mentions of the word "both."
New Auto-Interp
Negative Logits
Ver
-0.54
favorite
-0.53
Ver
-0.51
ver
-0.50
mer
-0.49
seb
-0.49
TextAppearance
-0.48
wanna
-0.48
pas
-0.47
sos
-0.47
POSITIVE LOGITS
both
1.69
både
1.50
both
1.50
zowel
1.46
Both
1.38
Both
1.31
sowohl
1.27
zarówno
1.20
tanto
1.18
BOTH
1.17
Activations Density 0.146%