Currently, I'm working on a scientific project about insects, and I've been keeping a journal of the insects I've found so far. I now understand that listing the names of every insect I have been finding with each observation was a mistake. I am not authorized to share too much information because it is private, but I will add a representative sample of my situation to the table below:
| # of sample |
insect (family) |
| 1 |
Dermestidae, Histeridae |
| 2 |
Histeridae, Dichotumius |
| 3 |
Histeriade |
| 4 |
Dermestidae, Histeridae |
| 5 |
Cleridae, Dichotumius |
| 485 |
Histeriade |
| 486 |
Dermestidae, Histeridae |
| 487 |
Dermestidae, Cleridae |
| 488 |
Histeriade |
Something like the above table. In my actual table, I have cells with 5 or 6 diferent insects. The thing is:
- How can I search for all the different values? I mean, I want to create a table that contains all the different values and how many of them are... Something like the following table:
| Insect (family) |
Count |
| Cleridae |
54 |
| Histeridae |
154 |
| Dermestidae |
34 |
(There are at least 100 different insects and some of them just appear once, so it is impossible for me to search all the different names manually.
Furthermore, I was thinking about converting my table to a long structure. Something like the following;
Instead of this:
| # of sample |
insect (family) |
| 1 |
Dermestidae, Histeridae |
| 2 |
Histeridae, Dichotumius |
| 3 |
Histeriade |
| 4 |
Dermestidae, Histeridae |
| 5 |
Cleridae, Dichotumius |
I want this:
| # of sample |
insect (family) |
| 1 |
Dermestidae |
| 1 |
Histeridae |
| 2 |
Histeridae |
| 2 |
Dichotumius |
| 3 |
Histeriade |
| 4 |
Dermestidae |
| 4 |
Histeridae |
| 5 |
Cleridae |
| 5 |
Dichotumius |
I was thinking that this arrangement should be better than the one that I have now. I hope someone can help me with this issue.