INDEX
Explanations
adjectives likely related to the concept of something being genuine or authentic
references to the concept of reality
New Auto-Interp
Negative Logits
ammunition
-0.67
prey
-0.63
initiative
-0.61
ens
-0.60
annex
-0.59
heading
-0.58
cover
-0.57
slide
-0.57
rate
-0.57
boil
-0.56
POSITIVE LOGITS
real
4.41
Real
2.28
reality
1.84
REAL
1.82
real
1.67
actual
1.63
Real
1.51
really
1.22
oreal
1.15
true
1.13
Activations Density 0.016%