INDEX
Explanations
phrases expressing frustration or exasperation
New Auto-Interp
Negative Logits
^(@)
-0.67
Искәрмәләр
-0.66
)
-0.59
出版年
-0.59
IndentedString
-0.58
.}~\
-0.57
లాలు
-0.57
contentLoaded
-0.57
-0.57
NUMX
-0.56
POSITIVE LOGITS
hell
1.52
fuck
1.31
heck
1.19
HELL
1.13
FUCK
1.05
hell
1.03
Hell
1.02
Hell
0.97
fuck
0.97
shit
0.97
Activations Density 0.121%