INDEX
Explanations
phrases indicating a comparison, often suggesting a superior or dominant position
the word "out" in various contexts
New Auto-Interp
Negative Logits
士
-0.72
otaur
-0.65
Municip
-0.63
ahime
-0.63
Symbol
-0.61
Brach
-0.60
idate
-0.56
vernment
-0.54
Minotaur
-0.54
Decay
-0.53
POSITIVE LOGITS
fitted
1.12
loud
0.95
number
0.93
posts
0.93
fitting
0.92
lasting
0.88
stretched
0.86
doors
0.86
last
0.86
range
0.83
Activations Density 0.066%