INDEX
Explanations
stone's throw and proximity
New Auto-Interp
Negative Logits
Loads
-0.81
婦人
-0.81
Wrk
-0.77
irme
-0.77
Learned
-0.77
reciate
-0.75
もちゃ
-0.75
vestidos
-0.73
quanti
-0.71
geweldig
-0.70
POSITIVE LOGITS
stone
2.36
stones
2.23
hop
1.66
stone
1.58
hair
1.38
Stone
1.38
石
1.37
stones
1.32
whis
1.31
short
1.24
Activations Density 0.015%