INDEX
Explanations
phrases related to paying attention to small details or occurrences
references to small, significant details or nuances
New Auto-Interp
Negative Logits
etheus
-0.69
ayn
-0.67
igion
-0.63
ogh
-0.62
åĭ
-0.62
chwitz
-0.61
Roses
-0.61
xit
-0.60
hovah
-0.60
oglu
-0.59
POSITIVE LOGITS
(<
1.22
tiny
1.08
insignificant
1.06
increments
0.93
sized
0.92
tiny
0.88
smallest
0.88
manageable
0.87
tweaks
0.86
tucked
0.84
Activations Density 0.313%