INDEX
Explanations
occurrences of the word "Journal" and related terms indicating academic or professional publications
New Auto-Interp
Negative Logits
ewn
-0.17
asser
-0.17
zk
-0.16
ext
-0.15
coe
-0.15
ertools
-0.15
785
-0.15
ulos
-0.15
ater
-0.15
ets
-0.15
POSITIVE LOGITS
istic
0.31
ists
0.28
ism
0.24
isms
0.23
istics
0.23
ize
0.23
istically
0.23
isted
0.22
ISM
0.21
izing
0.21
Activations Density 0.020%