INDEX
Explanations
references to alternatives or options in various contexts
alternatives to
New Auto-Interp
Negative Logits
-0.56
-0.45
,
-0.44
"
-0.42
'
-0.41
(
-0.40
-0.39
“
-0.39
Pog
-0.39
↵
-0.38
POSITIVE LOGITS
alternatives
1.61
Alternatives
1.59
alternatives
1.57
Alternatives
1.52
alternativas
1.13
فريبيس
0.91
'\\;'
0.89
substitutes
0.88
autorytatywna
0.84
0.83
Activations Density 0.010%