INDEX
Explanations
accusations or claims made in a sentence
the word "that" and its various repetitions indicating claims or assertions within a text
New Auto-Interp
Negative Logits
aukee
-0.99
ãĤ´ãĥ³
-0.85
izont
-0.79
alt
-0.78
Laughs
-0.77
Wide
-0.76
think
-0.75
WAYS
-0.74
unction
-0.73
inki
-0.72
POSITIVE LOGITS
they
0.90
improper
0.79
inadequate
0.78
soever
0.78
improperly
0.74
defendants
0.74
he
0.74
faulty
0.73
smugglers
0.73
although
0.71
Activations Density 0.207%