INDEX
Explanations
the word "fact" and related words that express that something is real
New Auto-Interp
Negative Logits
fact
-2.27
fact
-1.84
Fact
-1.64
Fact
-1.63
Tatsache
-1.45
FACT
-1.27
Facts
-1.09
fakta
-1.09
hecho
-1.07
факт
-1.05
POSITIVE LOGITS
PhysRevLett
0.67
muñ
0.57
Sinon
0.55
getRule
0.54
Hentet
0.53
zzar
0.52
bologna
0.52
CreateIndex
0.52
uties
0.51
dress
0.50
Activations Density 1.785%