代码之家  ›  专栏  ›  技术社区  ›  psoares

Phylo BioPython建筑树

  •  6
  • psoares  · 技术社区  · 14 年前


    到目前为止,我所做的是这张图片: alt text

    序列对齐的文件: file
    具有生成树的距离的文件: file

        fh = open(MEDIA_ROOT + "groupsnp.txt")    
        list_size = {}
        for line in fh:
            if '>' in line:
                labels = line.split('>')
                label = labels[-1]
                label = label.split()
                num = line.split('-')
                size = num[-1]
                size = size.split()
                for lab in label:
                    for number in size:
                        list_size[lab] = int(number)
    
        a = array(list_size.values())
    

    但是数组是任意的,我想把正确的节点大小放入正确的节点中,我尝试了如下操作:

             for elem in list_size.keys():
                 if labels == elem:
                     Phylo.draw_graphviz(tree_xml, prog="neato", node_size=a)
    

    但是当我使用if语句时什么也没有出现。

    不管怎么说?

    谢谢大家

    1 回复  |  直到 14 年前
        1
  •  8
  •   rwilliams    14 年前

    我终于开始工作了。基本前提是你要使用 labels/nodelist 建立你的 node_sizes . 这样它们就适当地相互关联了。我确信我遗漏了一些重要的选项来使树看起来100%,但是看起来节点大小显示正确。

    #basically a stripped down rewrite of Phylo.draw_graphviz
    import networkx, pylab
    from Bio import Phylo
    
    
    #taken from draw_graphviz
    def get_label_mapping(G, selection): 
        for node in G.nodes(): 
            if (selection is None) or (node in selection): 
                try: 
                    label = str(node) 
                    if label not in (None, node.__class__.__name__): 
                        yield (node, label) 
                except (LookupError, AttributeError, ValueError): 
                    pass
    
    
    kwargs={}
    tree = Phylo.read('tree.dnd', 'newick')
    G = Phylo.to_networkx(tree)
    Gi = networkx.convert_node_labels_to_integers(G, discard_old_labels=False)
    
    node_sizes = []
    labels = dict(get_label_mapping(G, None))
    kwargs['nodelist'] = labels.keys()
    
    #create our node sizes based on our labels because the labels are used for the node_list
    #this way they should be correct
    for label in labels.keys():
        if str(label) != "Clade":
            num = label.name.split('-')
            #the times 50 is just a guess on what would look best
            size = int(num[-1]) * 50
            node_sizes.append(size)
    
    kwargs['node_size'] = node_sizes
    posi = networkx.pygraphviz_layout(Gi, 'neato', args='') 
    posn = dict((n, posi[Gi.node_labels[n]]) for n in G) 
    
    networkx.draw(G, posn, labels=labels, node_color='#c0deff', **kwargs)
    
    pylab.show()
    

    结果树 alt text