INDEX
Explanations
references to average values in data
New Auto-Interp
Negative Logits
ulemon
-0.65
DialogInterface
-0.62
sp
-0.60
Murdoch
-0.56
ed
-0.56
hasMoreElements
-0.55
sk
-0.55
emb
-0.54
Kirk
-0.54
führt
-0.54
POSITIVE LOGITS
average
3.23
average
3.04
Average
3.03
Average
2.96
AVERAGE
2.85
averages
2.64
AVERAGE
2.62
averaged
2.55
avg
2.46
averaging
2.46
Activations Density 0.078%