INDEX
Explanations
the presence of the word "der" in various contexts
New Auto-Interp
Negative Logits
OrNil
-0.50
InjectAttribute
-0.45
consum
-0.45
guan
-0.45
recommendation
-0.44
__*/
-0.44
Guan
-0.44
recom
-0.43
developed
-0.42
NameValuePair
-0.42
POSITIVE LOGITS
der
1.59
der
0.87
Der
0.83
Der
0.79
DER
0.65
deri
0.56
DER
0.53
dert
0.52
derma
0.52
dering
0.48
Activations Density 0.060%