INDEX
Explanations
sentences or sentence fragments
references to sentences and their structure
New Auto-Interp
Negative Logits
oppable
-0.84
jri
-0.82
ebus
-0.80
asio
-0.80
romeda
-0.76
steamapps
-0.75
kus
-0.75
OWN
-0.75
IED
-0.74
edIn
-0.74
POSITIVE LOGITS
uttered
1.10
snippet
0.99
describing
0.96
typed
0.95
snippets
0.94
phrases
0.93
quotation
0.91
spoken
0.89
mith
0.89
highlighting
0.87
Activations Density 0.091%