INDEX
Explanations
words that indicate capability or potential in various contexts
New Auto-Interp
Negative Logits
Σε
-0.82
[^
-0.80
WindowConstants
-0.76
ifrance
-0.74
assium
-0.74
}^{[-0.73
estinal
-0.68
ioutil
-0.68
ویکیپدیای
-0.68
esgue
-0.68
POSITIVE LOGITS
of
1.57
των
0.83
agisse
0.83
ardless
0.79
Lors
0.76
فريبيس
0.74
Beware
0.74
]%
0.73
apesar
0.73
يتيمه
0.72
Activations Density 0.740%