INDEX
Explanations
sentences or phrases that express a lack of confidence or uncertainty surrounding a situation
New Auto-Interp
Negative Logits
Flavoring
-0.74
kefeller
-0.63
éŃĶ
-0.63
Merit
-0.60
ickets
-0.59
osate
-0.56
advertising
-0.55
OPEC
-0.55
Cosponsors
-0.54
Reboot
-0.54
POSITIVE LOGITS
!?
0.78
????????
0.75
?!
0.71
????
0.70
???
0.67
...?
0.64
huh
0.64
course
0.60
âĶĢâĶĢâĶĢâĶĢ
0.60
"?
0.59
Activations Density 0.564%