INDEX
Explanations
phrases and terms related to urgency and necessity, particularly in the context of personal freedom and choice
New Auto-Interp
Negative Logits
__':
-0.72
الحره
-0.61
ยว
-0.56
startY
-0.56
IsContent
-0.56
Unnamed
-0.55
stości
-0.54
XmlAccessorType
-0.54
__':
-0.54
__":
-0.54
POSITIVE LOGITS
!!!)
0.70
!!!!!!
0.70
!!!!!
0.67
ONLY
0.67
!)
0.66
CANNOT
0.65
NEVER
0.65
(!)
0.64
!!!!!!!!
0.64
!!!!!!!
0.64
Activations Density 0.195%