INDEX
Explanations
phrases that end with 'ed' or ',' with high activation values
instances of commas followed by varying numerical or comparative expressions
New Auto-Interp
Negative Logits
Founding
-0.72
aha
-0.70
¬¼
-0.66
ahu
-0.65
è£ħ
-0.65
ãģł
-0.65
inar
-0.64
asio
-0.62
ahs
-0.62
swick
-0.61
POSITIVE LOGITS
albeit
1.03
whereas
1.01
except
0.97
although
0.96
however
0.95
but
0.91
owing
0.91
costing
0.89
according
0.89
prompting
0.88
Activations Density 0.287%