INDEX
Explanations
phrases indicating significance or importance
frequent phrases that indicate emphasis, specificity, or attributes in descriptions
New Auto-Interp
Negative Logits
mare
-0.76
antioxid
-0.73
interstitial
-0.72
arta
-0.71
mares
-0.71
VIDEOS
-0.70
bara
-0.69
awa
-0.69
ethy
-0.65
Dialogue
-0.62
POSITIVE LOGITS
originally
1.07
instrumental
0.98
initially
0.91
last
0.84
unsuccessful
0.84
conceived
0.78
successful
0.77
earlier
0.77
previously
0.75
gracious
0.74
Activations Density 0.544%