INDEX
Explanations
expressions of positive emotions or sentiments
Expressing personal feelings or opinions
glad we were able to
New Auto-Interp
Negative Logits
niestety
-0.62
fascinating
-0.62
sadly
-0.61
encji
-0.59
famously
-0.58
notoriously
-0.57
unfortunately
-0.56
enviable
-0.55
hilarious
-0.55
unbearable
-0.54
POSITIVE LOGITS
finally
0.80
Finally
0.72
Finally
0.71
finally
0.69
finalmente
0.68
able
0.66
AssemblyTitle
0.65
ویکیپدیا
0.65
GeoNames
0.63
наконец
0.63
Activations Density 0.223%