INDEX
Explanations
expressions related to programming languages and operations
phrases involving explanatory clauses or additional information
New Auto-Interp
Negative Logits
ocese
-0.65
andals
-0.64
realDonaldTrump
-0.62
rero
-0.61
ificent
-0.61
ļ
-0.61
Dynasty
-0.58
leground
-0.57
raved
-0.57
azeera
-0.57
POSITIVE LOGITS
whereas
1.38
whereby
1.33
namely
1.30
meaning
1.22
ie
1.20
wherein
1.19
aka
1.09
which
1.06
thereby
1.05
implying
1.04
Activations Density 0.464%