INDEX
Explanations
phrases or sentences that introduce a new idea or aspect to a conversation
affirmations or expressions of agreement
New Auto-Interp
Negative Logits
İĭ
-0.70
flair
-0.64
âĹ¼
-0.64
illary
-0.63
unal
-0.62
MX
-0.61
Parables
-0.60
wedge
-0.58
lightsaber
-0.58
garage
-0.58
POSITIVE LOGITS
esley
0.99
come
0.89
ington
0.78
ards
0.76
espie
0.76
Coin
0.68
FTWARE
0.68
tenance
0.67
STON
0.67
suited
0.67
Activations Density 0.024%