g-tries: an efficient data structure for discovering network motifs

In this paper we propose a novel specialized data structure that we call g-trie, designed to deal with collections of subgraphs. The main conceptual idea is akin to a prefix tree in the sense that we take advantage of common topology by constructing a multiway tree where the descendants of a node sh...

ver descrição completa

Detalhes bibliográficos
Autor principal: Ribeiro, P (author)
Outros Autores: Silva, F (author)
Formato: book
Idioma:eng
Publicado em: 2010
Assuntos:
Texto completo:https://hdl.handle.net/10216/101828
País:Portugal
Oai:oai:repositorio-aberto.up.pt:10216/101828
Descrição
Resumo:In this paper we propose a novel specialized data structure that we call g-trie, designed to deal with collections of subgraphs. The main conceptual idea is akin to a prefix tree in the sense that we take advantage of common topology by constructing a multiway tree where the descendants of a node share a common substructure. We give algorithms to construct a g-trie, to list all stored subgraphs, and to find occurrences on another graph of the subgraphs stored in the g-trie. We evaluate the implementation of this structure and its associated algorithms on a set of representative benchmark biological networks in order to find network motifs. To assess the efficiency of our algorithms we compare their performance with other known network motif algorithms also implemented in the same common platform. Our results show that indeed, g-tries are a feasible, adequate and very efficient data structure for network motifs discovery, clearly outperforming previous algorithms and data structures. (c) 2010 ACM.