INDEX
Explanations
phrases indicating sporadic or occasional events
phrases related to time and frequency
New Auto-Interp
Negative Logits
tnc
-0.65
SourceFile
-0.65
composition
-0.64
Flavoring
-0.62
horn
-0.62
tumblr
-0.60
constitu
-0.59
similarity
-0.58
manipulation
-0.58
portfolios
-0.57
POSITIVE LOGITS
ciating
0.78
ench
0.68
iatus
0.63
ORED
0.62
here
0.62
hew
0.62
recons
0.60
wu
0.59
usted
0.58
Chaff
0.58
Activations Density 0.419%