INDEX
Explanations
phrases related to social events and actions
punctuations and their occurrences in various contexts
New Auto-Interp
Negative Logits
uber
-0.82
enary
-0.79
iple
-0.77
ibo
-0.76
UF
-0.76
isi
-0.71
yon
-0.69
rius
-0.67
raved
-0.66
¬¼
-0.66
POSITIVE LOGITS
whereas
1.19
although
1.18
albeit
1.15
though
1.13
but
1.05
which
1.02
however
1.00
namely
0.96
favoring
0.95
meanwhile
0.94
Activations Density 0.724%