INDEX
Explanations
the term "reality" in various contexts discussing truth and perception
New Auto-Interp
Negative Logits
est
-0.18
esian
-0.18
ering
-0.16
ered
-0.15
annon
-0.15
uted
-0.15
Activity
-0.15
Activity
-0.15
our
-0.15
à¸Ħร
-0.15
POSITIVE LOGITS
istically
0.23
igned
0.22
check
0.21
-life
0.20
istical
0.19
Check
0.19
ignment
0.19
itious
0.18
ITY
0.18
CHECK
0.18
Activations Density 0.022%