INDEX
Explanations
the phrase "all of it" or variations of it
phrases that express absolutes or totality
New Auto-Interp
Negative Logits
SHIP
-0.69
aminer
-0.69
lict
-0.66
DF
-0.59
IDS
-0.59
incre
-0.58
Mellon
-0.58
nant
-0.57
edin
-0.57
licts
-0.56
POSITIVE LOGITS
ocating
0.91
iance
0.86
uring
0.85
ahu
0.82
usive
0.80
ogene
0.80
iter
0.80
ocation
0.78
ergic
0.77
owing
0.75
Activations Density 0.072%