Abstract | ||
---|---|---|
Nowadays, a large part of the online biological data resides in the deep web. Lately, there have been several efforts focusing on integrating and providing search functionality for biological deep web data sources. Such systems often require data access involving a large number of remote data sources and the use of various communication links. Both the servers and networking links are vulnerable to congestion and failures. This can lead to an unpredictable unavailability or inaccessibility, which can disrupt access to the information. In this paper, we propose a solution to maintain query processing capability of an integrated biological deep web search system in the presence of unavailable or inaccessible data sources. Our solution involves dynamically adapting query processing when unexpected data source unavailability or inaccessibility is detected. We exploit the data redundancy that is found across biological deep web data sources. We incrementally generate a partial new query plan by bringing in new data sources that were not in the original query plan to replace the subplan that became inaccessible. |
Year | DOI | Venue |
---|---|---|
2010 | 10.1109/BIBE.2010.13 | BIBE |
Keywords | Field | DocType |
deep web,remote data source,data redundancy,inaccessible data source,online biological data resides,unexpected data source unavailability,new data source,domain-specific deep web search,data access,biological deep web data,integrated biological deep web,self-healing approach,internet,fault tolerant,redundancy,fault tolerance,biological data | Biological data,Computer science,Server,Unavailability,Data redundancy,Artificial intelligence,The Internet,Query plan,World Wide Web,Exploit,Bioinformatics,Data access,Machine learning | Conference |
Citations | PageRank | References |
1 | 0.36 | 11 |
Authors | ||
2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Fan Wang | 1 | 46 | 7.29 |
Gagan Agrawal | 2 | 2058 | 209.59 |