INDEX
Explanations
scores or ratings within a text
references to scoring or ratings in various contexts
New Auto-Interp
Negative Logits
detachment
-0.64
xon
-0.63
ĸļ
-0.59
etheless
-0.59
Wonderland
-0.59
Workers
-0.58
Cellular
-0.58
fascination
-0.58
compulsion
-0.58
agan
-0.57
POSITIVE LOGITS
card
1.39
cards
1.25
ific
1.06
keeper
0.99
keepers
0.94
heet
0.87
rs
0.86
keeping
0.86
emi
0.82
ifications
0.81
Activations Density 0.032%