INDEX
Explanations
phrases related to religious scripture being considered authoritative
references to religious texts and their interpretations
New Auto-Interp
Negative Logits
ktop
-0.77
stunts
-0.77
NetMessage
-0.74
layoffs
-0.71
disruptions
-0.71
spikes
-0.71
Pool
-0.71
cffff
-0.70
hov
-0.69
queues
-0.69
POSITIVE LOGITS
infall
1.50
authoritative
1.45
accurate
1.38
trustworthy
1.37
valid
1.35
truthful
1.34
correct
1.32
true
1.24
truth
1.24
conclusive
1.20
Activations Density 0.382%