Non-redundant protein sequence sets