INDEX
Explanations
instances of excitement or emotional responses
New Auto-Interp
Negative Logits
ltk
-0.56
CrossRef
-0.56
NDEBUG
-0.54
abestanden
-0.54
.~\
-0.53
'+':
-0.52
}?>
-0.52
antd
-0.52
'-':
-0.51
Zy
-0.51
POSITIVE LOGITS
wow
1.76
boy
1.62
Wow
1.59
WOW
1.58
wow
1.56
Wow
1.50
WOW
1.44
oh
1.41
OMG
1.34
holy
1.29
Activations Density 0.126%