INDEX
Explanations
strongly opinionated statements or beliefs
references to beliefs or ideologies that are framed as problematic or dangerous
New Auto-Interp
Negative Logits
achu
-0.76
Edit
-0.74
oop
-0.73
aido
-0.72
options
-0.71
acid
-0.71
ents
-0.70
ecided
-0.69
intent
-0.67
otte
-0.67
POSITIVE LOGITS
manifestation
1.24
reflection
1.20
symptom
1.15
reminder
1.12
testament
1.07
continuation
1.02
culmination
1.02
distraction
1.01
betrayal
1.00
contradiction
0.99
Activations Density 0.171%