INDEX
Explanations
adjectives and phrases related to various qualities or conditions
adjectives and their modifiers that describe conditions or states
New Auto-Interp
Negative Logits
};
-0.86
',
-0.86
'.
-0.83
guiName
-0.77
.",
-0.77
."
-0.76
,"
-0.74
];
-0.71
%.
-0.71
",
-0.70
POSITIVE LOGITS
)
1.89
-)
1.62
)-
1.59
?)
1.59
)'
1.59
*)
1.58
!)
1.50
)"
1.49
)/
1.47
)*
1.42
Activations Density 0.305%