INDEX
Explanations
instances where something is being forcefully brought to an end or coming to a striking realization
phrases that indicate impact or influence
New Auto-Interp
Negative Logits
gio
-0.77
uploads
-0.73
atts
-0.71
idia
-0.70
jri
-0.66
reviewed
-0.65
phrine
-0.65
taboola
-0.64
urdue
-0.62
rss
-0.62
POSITIVE LOGITS
bane
0.72
lightning
0.69
iron
0.69
ritic
0.68
road
0.67
hare
0.67
hardest
0.66
itch
0.66
stride
0.64
ortmund
0.64
Activations Density 0.106%