INDEX
Explanations
phrases containing the word "so"
the phrase "so-called" in various contexts
New Auto-Interp
Negative Logits
theless
-0.69
NTS
-0.65
Mens
-0.58
takedown
-0.56
upside
-0.56
MFT
-0.56
stag
-0.56
Adjust
-0.56
whistleblower
-0.55
bulletin
-0.55
POSITIVE LOGITS
apy
1.37
oths
1.25
aps
1.20
iled
1.20
bered
1.07
vi
1.04
iling
1.03
pping
1.01
aring
1.00
ire
0.99
Activations Density 0.036%