INDEX
Explanations
instances of the word "simple" or related forms that convey simplicity
New Auto-Interp
Negative Logits
Inscrivez
-0.50
Karriere
-0.49
tenure
-0.46
HostException
-0.44
befind
-0.44
Rptr
-0.43
InStock
-0.43
prestaciones
-0.43
notamment
-0.43
Tenure
-0.43
POSITIVE LOGITS
simple
1.16
Simple
1.09
Simple
1.08
simple
1.07
SIMPLE
1.02
semplici
0.98
SIMPLE
0.98
simples
0.97
einfachen
0.91
simpl
0.90
Activations Density 0.038%