INDEX
Explanations
names of specific institutes and universities
occurrences of the term "alk" and its derivatives
New Auto-Interp
Negative Logits
curfew
-0.73
cess
-0.70
kins
-0.69
icip
-0.64
millenn
-0.63
CCC
-0.60
ESCO
-0.59
ENTS
-0.58
SPONSORED
-0.58
rica
-0.58
POSITIVE LOGITS
gren
1.06
hoff
1.04
ength
1.04
lund
1.00
strom
0.97
berg
0.93
heimer
0.92
enger
0.88
opian
0.87
aer
0.87
Activations Density 0.057%