INDEX
Explanations
mentions of the word "truth."
occurrences of the word "truth."
New Auto-Interp
Negative Logits
Wolves
-0.68
zzi
-0.68
ATIONS
-0.66
Apocalypse
-0.65
hiba
-0.64
backer
-0.63
Zombie
-0.62
Goods
-0.61
Attention
-0.60
eland
-0.59
POSITIVE LOGITS
anasia
1.10
osate
0.94
lessness
0.94
lessly
0.94
urst
0.93
ilater
0.90
ouse
0.89
umbing
0.89
sburg
0.88
reys
0.88
Activations Density 0.042%