INDEX
Explanations
instances and mentions of examples in various contexts
New Auto-Interp
Negative Logits
noir
-0.43
longtemps
-0.41
Potatoes
-0.40
Potatoes
-0.40
ardından
-0.40
marchandises
-0.39
ranath
-0.39
nieruchomości
-0.39
elétrica
-0.39
north
-0.38
POSITIVE LOGITS
Example
1.20
example
1.20
example
1.18
Example
1.17
examples
1.13
EXAMPLE
1.13
EXAMPLE
1.08
Exemple
0.99
exemple
0.99
Examples
0.98
Activations Density 0.157%