INDEX
Explanations
expressions of frustration or strong emotions
New Auto-Interp
Negative Logits
OGND
-0.70
auffi
-0.66
HasAnnotation
-0.65
ProtoMessage
-0.64
Искәрмәләр
-0.61
Források
-0.60
незавершена
-0.59
__':
-0.59
kaarangay
-0.58
pozdrawiam
-0.57
POSITIVE LOGITS
fucking
2.35
goddamn
2.19
damn
2.19
fucking
2.11
fuckin
2.10
freaking
2.04
Fucking
2.00
freakin
1.99
damned
1.97
frig
1.90
Activations Density 0.446%