INDEX
Explanations
the word "only" with a focus on emphasizing limitations or exclusivity
phrases expressing the idea of something being partially complete or insufficient
New Auto-Interp
Negative Logits
ros
-0.74
ducers
-0.66
idon
-0.63
rigan
-0.63
wealth
-0.62
rosis
-0.61
rote
-0.61
insula
-0.60
atana
-0.60
hement
-0.59
POSITIVE LOGITS
marginally
1.08
kidding
0.79
scratched
0.78
temporary
0.73
partially
0.72
scratching
0.68
allowed
0.67
accessible
0.67
temporarily
0.66
indirectly
0.66
Activations Density 0.064%