INDEX
Explanations
concerns or worries in the text
concerns and worries expressed in relation to various topics
New Auto-Interp
Negative Logits
hesis
-0.85
SPONSORED
-0.79
hiba
-0.78
oun
-0.74
redits
-0.73
Written
-0.72
zynski
-0.71
aughtered
-0.71
Lens
-0.70
chnology
-0.69
POSITIVE LOGITS
preserving
0.99
getting
0.89
losing
0.88
protecting
0.87
improving
0.86
whether
0.84
escaping
0.82
mortality
0.81
saving
0.81
maintaining
0.80
Activations Density 0.060%