INDEX
Explanations
references to numerical strength or forces in a narrative context
New Auto-Interp
Negative Logits
ế
-0.17
klady
-0.16
elves
-0.16
deposit
-0.16
avr
-0.15
ìŀ¡
-0.15
orbit
-0.15
deposit
-0.15
íĥĦ
-0.15
åħĥ
-0.15
POSITIVE LOGITS
strength
0.15
edge
0.14
OfSize
0.14
box
0.14
against
0.14
æ¬
0.14
strength
0.14
/lg
0.14
Conserv
0.14
ת
0.14
Activations Density 0.346%