INDEX
Explanations
words related to instructions, maintenance, comparison, and clarification
terms related to formal processes and evaluations
New Auto-Interp
Negative Logits
olated
-0.68
apa
-0.68
anon
-0.65
amaru
-0.63
former
-0.63
habi
-0.62
mare
-0.59
ivan
-0.58
inese
-0.56
gall
-0.56
POSITIVE LOGITS
purposes
2.33
sake
1.88
reasons
1.57
purpose
1.23
ummies
1.06
reason
0.97
Reasons
0.91
occasions
0.82
emergencies
0.80
eternity
0.78
Activations Density 0.373%