INDEX
Explanations
phrases indicating conditional situations
the repeated use of the phrase "it" in various contexts
New Auto-Interp
Negative Logits
holding
-0.64
Priv
-0.62
package
-0.60
estimating
-0.59
quartered
-0.58
caution
-0.58
phia
-0.58
ight
-0.57
Trouble
-0.57
legends
-0.56
POSITIVE LOGITS
chy
1.11
happens
1.00
alian
0.98
rains
0.98
ain
0.97
mattered
0.94
hurts
0.94
happened
0.92
unes
0.91
wasn
0.89
Activations Density 0.102%