INDEX
Explanations
the word "Instead"
instances of the word "instead" and phrases that suggest alternatives
New Auto-Interp
Negative Logits
Can
-0.64
Orig
-0.63
Se
-0.63
Frequ
-0.61
Hazard
-0.61
Tong
-0.60
Log
-0.59
Swe
-0.59
Family
-0.59
Fall
-0.59
POSITIVE LOGITS
Instead
3.01
Rather
2.26
instead
1.72
Nonetheless
1.53
Nevertheless
1.49
Fortunately
1.45
Luckily
1.44
Therefore
1.39
Moreover
1.39
Thankfully
1.38
Activations Density 0.044%