INDEX
Explanations
questions asking if something is truly necessary or being questioned
phrases and questions related to the necessity and impact of various controversial topics
New Auto-Interp
Negative Logits
displayText
-0.76
geoning
-0.69
wx
-0.64
©¶æ¥µ
-0.63
catentry
-0.62
iferation
-0.62
agraph
-0.61
CTV
-0.60
yk
-0.59
TB
-0.59
POSITIVE LOGITS
?:
1.12
?",
1.03
?
1.02
?'
0.98
?).
0.96
...?
0.96
?????
0.94
?".
0.94
?),
0.93
anymore
0.89
Activations Density 0.308%