INDEX
Explanations
adverbs that modify the manner of actions or states
New Auto-Interp
Negative Logits
more
-0.60
toBe
-0.59
a
-0.54
ToBe
-0.51
alent
-0.50
aber
-0.50
quite
-0.49
serem
-0.49
blown
-0.48
แฟ
-0.47
POSITIVE LOGITS
openConnection
0.94
tonode
0.84
تانيه
0.81
endregion
0.78
gonic
0.77
Dichter
0.76
uples
0.75
}*/
0.74
avía
0.73
gamesh
0.73
Activations Density 0.209%