About

What is Bio-vector?

Bio-vector: encode the habitant distribution of species  
A species can be represented by a vector. Species in the same colony or with highly overlaped habitant distribution would have similar vectors.
Say that colony A and B are similar:
  • If a rare insect x appears in colony A, B might also be a potential colony where x shows up.
  • If an endangered insect species x is successfully reproduced in A colony, then B would likely be another feasible colony for species reproduction.
  • If an ecological experiment is held in control group A, then B probably is a suitable treatment group.
  • To measure the similarity between species, in addition to taxonomy and DNA, bio-vector is a new similarity measurement from the perspective of the ecological environment.
  • What is de Bugger?

    de Bugger is a geographic bioinfo emulator empowered by a CNN engine to integrate the rich information including precipitation, vegetation, temperature, etc. provided by NASA Earthdata Search. Based on our novel bio-vectors and environmental conditions, de Bugger can predict the future distribution of different types of insects.

    Furthermore, we believe that the prediction of different species can rely on different environmental information. To address this problem, an attention module is integrated into the CNN which selects important features automatically during training.

    In this way, the attention module masks out all unrelated information and leaves the import feature for further usage.

    How to use it?

    The proposed model, insect2vec, calculates an embedding for each insect by considering the co-occurrence between insects in a real-world environment. The insect2vec (bio-vector) can be applied in many analytical tasks.

    For example, one can easily estimate the habitat distribution of an insect by using the bio-vector integrated with geo-spatial data. Based on this model, one can also predict which insect species could be harsh to the environment in a place and accordingly take actions to avoid predictable ecological disasters.