INDEX
Explanations
terms or phrases related to definitions or interpretations
instances of the word "definition" and its associated context
New Auto-Interp
Negative Logits
rop
-0.62
imentary
-0.61
Antar
-0.61
ili
-0.61
Elect
-0.60
roph
-0.60
itch
-0.59
orld
-0.59
Pradesh
-0.59
Observer
-0.58
POSITIVE LOGITS
inition
1.11
definition
1.07
definitions
1.06
initions
1.01
definition
0.93
defines
0.89
Definition
0.88
witz
0.88
naire
0.83
REDACTED
0.81
Activations Density 0.010%