INDEX
Explanations
strings of characters that may not form coherent words or phrases
separators, punctuation, and symbols typically used in formatting content
New Auto-Interp
Negative Logits
telling
-0.90
acknowledged
-0.87
affirm
-0.83
headlines
-0.83
responding
-0.83
increasingly
-0.82
prompted
-0.82
threatened
-0.82
belie
-0.82
consequences
-0.81
POSITIVE LOGITS
Micro
1.38
Amb
1.34
Mini
1.34
Royal
1.33
Cola
1.31
Sem
1.31
Gall
1.30
Dual
1.30
Double
1.30
Gal
1.30
Activations Density 0.324%