INDEX
Explanations
phrases indicating contrast or change in direction
New Auto-Interp
Negative Logits
acus
-0.16
avig
-0.15
ãĤ¤ãĥ¤
-0.14
uele
-0.14
aje
-0.14
elps
-0.13
291
-0.13
bih
-0.13
ongoing
-0.13
lat
-0.13
POSITIVE LOGITS
/fwlink
0.23
ober
0.18
VERN
0.18
ÅĽcie
0.16
tangent
0.16
route
0.15
SSIP
0.15
дело
0.15
itant
0.15
broke
0.15
Activations Density 0.101%