INDEX
Explanations
terms related to stability and change over time
New Auto-Interp
Negative Logits
.DropTable
-0.17
aggress
-0.16
aat
-0.15
Goose
-0.15
ếu
-0.14
assembly
-0.14
Radians
-0.14
away
-0.14
ç¨
-0.14
one
-0.13
POSITIVE LOGITS
867
0.17
halt
0.17
ABEL
0.15
plib
0.15
pond
0.15
McKenzie
0.14
íļĮ
0.14
Reeves
0.14
dok
0.14
rophe
0.13
Activations Density 0.286%