INDEX
Explanations
phrases used for emphasizing innovation, importance, choices, and system improvements
phrases indicating actions or events that are impactful or significant
New Auto-Interp
Negative Logits
etheless
-0.83
Finally
-0.69
anything
-0.66
nonetheless
-0.65
iren
-0.64
pei
-0.64
yip
-0.63
EStream
-0.63
rentices
-0.61
patrick
-0.61
POSITIVE LOGITS
tremendous
0.68
inaccurate
0.66
great
0.66
omical
0.65
untrue
0.62
gorgeous
0.61
immense
0.60
ometric
0.59
visually
0.58
aest
0.58
Activations Density 0.081%