INDEX
Explanations
instances of the word "triple" related to numerical values or scores
New Auto-Interp
Negative Logits
sbm
-0.70
Downloadha
-0.69
istry
-0.68
latest
-0.66
ears
-0.65
hammad
-0.65
papers
-0.65
earing
-0.64
ega
-0.64
Dialogue
-0.63
POSITIVE LOGITS
plet
1.03
plets
1.00
ty
0.91
ts
0.88
header
0.84
toe
0.81
leans
0.78
teen
0.78
reme
0.76
ple
0.76
Activations Density 0.102%