INDEX
Explanations
sentences related to catastrophic events or global phenomena
New Auto-Interp
Negative Logits
onite
-0.77
mate
-0.77
calf
-0.73
ebus
-0.70
iris
-0.67
mates
-0.66
uana
-0.66
ube
-0.66
unspecified
-0.66
remorse
-0.65
POSITIVE LOGITS
Whether
1.29
Often
1.21
Especially
1.20
Particularly
1.17
Typically
1.15
However
1.14
Many
1.11
Few
1.10
Yet
1.07
Although
1.07
Activations Density 0.437%