INDEX
Explanations
adjectives and phrases related to evaluation or comparison
concepts related to distinctions between theory and fact or reality
New Auto-Interp
Negative Logits
Ĭ±
-0.73
Cosponsors
-0.69
Divide
-0.65
bryce
-0.64
Decre
-0.64
apego
-0.63
thora
-0.62
©¶æ¥µ
-0.61
Delay
-0.61
å¦
-0.61
POSITIVE LOGITS
actual
1.99
real
1.51
reality
1.43
Actual
1.35
actual
1.32
tangible
1.24
actually
1.21
realities
1.21
genuine
1.20
real
1.19
Activations Density 0.649%