INDEX
Explanations
phrases related to evidence or confirmation
the term "proven" in various contexts related to validation or evidence
New Auto-Interp
Negative Logits
adish
-0.75
letal
-0.72
ifle
-0.72
idays
-0.69
iewicz
-0.67
squats
-0.66
eeper
-0.64
umbn
-0.64
paio
-0.64
onductor
-0.62
POSITIVE LOGITS
ãĥ¼ãĥĨ
0.97
proven
0.92
iary
0.84
س
0.80
ãĤ¤ãĥĪ
0.78
refuted
0.78
ingen
0.78
Ô
0.75
debunked
0.75
\\\\\\\\
0.74
Activations Density 0.015%