INDEX
Explanations
text related to criticizing, commenting, or making negative evaluations about various topics or individuals
the word "about" and its related contexts
New Auto-Interp
Negative Logits
OGR
-0.87
hesis
-0.86
rift
-0.81
quez
-0.80
chnology
-0.77
KC
-0.72
hiba
-0.72
³³³³
-0.71
à¥
-0.69
waters
-0.69
POSITIVE LOGITS
how
0.97
whether
0.90
reforming
0.79
respecting
0.74
why
0.74
resolving
0.73
improving
0.72
migrating
0.70
changing
0.70
overcoming
0.70
Activations Density 0.125%