INDEX
Explanations
phrases that provide additional information or context
the word "although" and its variations, indicating contrast or exception
New Auto-Interp
Negative Logits
lees
-0.75
enter
-0.75
entry
-0.75
hal
-0.73
oire
-0.72
edu
-0.71
elle
-0.70
unes
-0.70
gem
-0.69
ath
-0.68
POSITIVE LOGITS
admittedly
0.94
acknowledging
0.88
interestingly
0.83
etheless
0.81
fortunately
0.80
retaining
0.78
technically
0.77
chery
0.77
tons
0.76
preferably
0.75
Activations Density 0.037%