INDEX
Explanations
quotes and statements from discussions or debates, particularly related to controversial or sensitive topics such as politics
New Auto-Interp
Negative Logits
brance
-0.74
ghai
-0.73
upstream
-0.71
Torrent
-0.69
soDeliveryDate
-0.69
Maiden
-0.68
deforestation
-0.66
士
-0.65
mortality
-0.65
wings
-0.65
POSITIVE LOGITS
rhet
1.05
sarcast
1.00
applause
0.99
incred
0.98
rebutt
0.97
Kimmel
0.97
moderator
0.95
chuck
0.94
laughter
0.91
condesc
0.91
Activations Density 0.505%