INDEX
Explanations
phrases indicating comparison or contrast
phrases including the term "anything" and its variations
New Auto-Interp
Negative Logits
ãĥĸ
-0.73
sbm
-0.71
ãĤ¤ãĥĪ
-0.68
Manufacturer
-0.68
ãĥ³ãĤ¸
-0.68
mud
-0.65
aceae
-0.64
Gong
-0.63
Rumble
-0.62
ãĥ£
-0.60
POSITIVE LOGITS
cohesion
0.64
leakage
0.63
feder
0.59
succeeds
0.59
assador
0.59
happens
0.59
ught
0.58
Missing
0.57
intel
0.57
ado
0.57
Activations Density 0.093%