INDEX
Explanations
phrases relating to comparison or inclusion as well as some verbs
phrases indicating comparisons or contrasts
New Auto-Interp
Negative Logits
Ãĥ
-0.43
usk
-0.43
ÂŃ
-0.42
���
-0.42
Gra
-0.41
Sport
-0.40
Aren
-0.40
Euro
-0.38
ath
-0.38
egu
-0.38
POSITIVE LOGITS
embodiments
0.57
secondly
0.52
thence
0.50
optionally
0.50
verbs
0.49
hereafter
0.49
ital
0.48
endif
0.47
specifying
0.46
recomp
0.46
Activations Density 1.068%