INDEX
Explanations
phrases related to distance or extent
references to progress or distance traveled
New Auto-Interp
Negative Logits
variable
-0.70
Ò
-0.67
é¾
-0.66
çļ
-0.66
advertisement
-0.65
Buzz
-0.65
200000
-0.62
ciating
-0.61
Parenthood
-0.61
OTOS
-0.60
POSITIVE LOGITS
tread
0.82
eele
0.79
encro
0.78
toward
0.77
slopes
0.75
towards
0.73
distances
0.73
pend
0.71
descent
0.71
reaching
0.71
Activations Density 0.158%