INDEX
Explanations
comparisons or similes
variations of the word "like" and related forms indicating comparison or resemblance
New Auto-Interp
Negative Logits
Nou
-0.76
ACA
-0.76
systematic
-0.68
Schools
-0.67
residency
-0.66
INA
-0.66
Interstitial
-0.65
Techniques
-0.65
trl
-0.63
fragment
-0.62
POSITIVE LOGITS
lihood
1.57
lik
1.10
liness
0.93
lik
0.91
ened
0.90
iour
0.88
ability
0.87
ening
0.86
likeness
0.82
liest
0.80
Activations Density 0.004%