INDEX
Explanations
references to regularity or routines
references to frequency or consistency
New Auto-Interp
Negative Logits
Sov
-0.80
lda
-0.73
adish
-0.67
kamp
-0.65
UST
-0.65
Drug
-0.65
arta
-0.65
apego
-0.64
IDA
-0.64
INGTON
-0.63
POSITIVE LOGITS
ity
1.17
ised
1.00
cy
0.98
isation
0.97
sized
0.91
weekday
0.90
intervals
0.90
ITY
0.89
ization
0.89
occurrence
0.88
Activations Density 0.014%