INDEX
Explanations
positive readiness and agreement
New Auto-Interp
Negative Logits
fucking
0.59
কারণে
0.43
kvůli
0.42
怎么办
0.39
Damn
0.39
Fuck
0.39
références
0.39
damn
0.38
」、
0.38
}{|0.38
POSITIVE LOGITS
:)
1.60
🙂
1.50
😊
1.48
:-)
1.37
:)
1.32
😊
1.32
😄
1.27
🙂
1.27
😀
1.26
:-)
1.23
Activations Density 0.123%