INDEX
Explanations
the word "probably"
the word "probably" indicating uncertainty or speculation
New Auto-Interp
Negative Logits
rients
-0.60
Held
-0.59
];
-0.58
ento
-0.57
Seeking
-0.57
ins
-0.56
powers
-0.56
iegel
-0.56
ignant
-0.55
ween
-0.55
POSITIVE LOGITS
probably
3.20
probably
2.63
doubtless
2.36
undoubtedly
2.13
definitely
2.08
Probably
2.06
likely
2.03
certainly
1.98
Probably
1.91
presumably
1.91
Activations Density 0.022%