INDEX
Explanations
phrases related to proven facts or established truths
occurrences of the word "proven" and its variations, indicating evidence or validation of claims
New Auto-Interp
Negative Logits
alon
-0.75
eeper
-0.72
utterstock
-0.71
Shoes
-0.70
regate
-0.65
adish
-0.65
querade
-0.64
umbn
-0.64
onew
-0.64
IDES
-0.64
POSITIVE LOGITS
iary
1.05
proven
0.90
iferation
0.84
è£ıè
0.74
debunked
0.73
icity
0.71
factual
0.69
âĵĺ
0.68
uable
0.68
icist
0.68
Activations Density 0.064%