INDEX
Explanations
the phrase "in fact" in texts
the phrase "in fact" and its variations within the text
New Auto-Interp
Negative Logits
Flavoring
-0.69
bye
-0.66
Citiz
-0.62
laure
-0.62
Crown
-0.62
illed
-0.60
ittal
-0.60
MENTS
-0.60
anon
-0.59
ution
-0.58
POSITIVE LOGITS
ional
0.93
netflix
0.86
ãĥĩãĤ£
0.73
olkien
0.72
REP
0.71
akes
0.70
lie
0.67
managed
0.65
staged
0.65
opus
0.65
Activations Density 0.023%