INDEX
Explanations
instances where two specific words are mentioned in the same context, as well as words related to numerical percentages
phrases related to value and quantity
New Auto-Interp
Negative Logits
exting
-0.65
jah
-0.65
ardless
-0.64
ãĤ¦ãĤ¹
-0.61
farious
-0.60
////////////////
-0.60
orously
-0.60
razil
-0.58
istically
-0.58
anity
-0.58
POSITIVE LOGITS
NBA
0.61
Knicks
0.61
Orioles
0.60
WWE
0.60
¶
0.58
.)
0.57
Tsarnaev
0.57
MLB
0.57
BART
0.56
MMA
0.56
Activations Density 2.214%