INDEX
Explanations
lexical items related to definitions and descriptions of various terms or concepts
discussions about definitions and their interpretations
New Auto-Interp
Negative Logits
iddles
-0.71
reau
-0.71
ennes
-0.69
liners
-0.66
Moves
-0.66
ADS
-0.65
DAQ
-0.65
lication
-0.65
llor
-0.65
loads
-0.64
POSITIVE LOGITS
acceptable
1.01
permissible
1.01
"
0.98
insanity
0.95
legality
0.91
'
0.90
"'
0.89
professionalism
0.88
\"
0.87
what
0.86
Activations Density 0.228%