INDEX
Explanations
the word "simply" and variations of it to indicate simplicity or straightforwardness in explanation
New Auto-Interp
Negative Logits
ISupport
-1.00
culturelles
-0.84
regardant
-0.84
colorés
-0.84
Gallardo
-0.83
debout
-0.82
passim
-0.82
timbangkan
-0.82
atteinte
-0.81
préfé
-0.81
POSITIVE LOGITS
SIMPLE
0.97
simpleType
0.96
Simply
0.92
Simply
0.92
simply
0.86
Simple
0.86
PLY
0.85
simple
0.84
simply
0.80
er
0.77
Activations Density 0.095%