INDEX
Explanations
repetitions of the word "fact."
New Auto-Interp
Negative Logits
illac
-0.15
wor
-0.14
XL
-0.14
Injector
-0.14
.habbo
-0.14
EventListener
-0.14
then
-0.13
(strtolower
-0.13
lectron
-0.13
stras
-0.13
POSITIVE LOGITS
fact
0.30
Fact
0.21
itious
0.19
Fact
0.19
that
0.18
izr
0.18
fact
0.18
bahwa
0.18
uality
0.17
ually
0.17
Activations Density 0.018%