Website and product categorizations
What Is Website Categorization?
Website categorization is a task of classifying website into one of predefined categories, also called taxonomies. Usually this is done by a supervised text classification machine learning model, because in deployment to production one often needs to classify a large number of texts.
Typical website categories
How many categories are in the taxonomy depends on the problem. E.g. in ecommerce setting the top Tier 1 level of categorization usually has 21 categories:
| Apparel & Accessories | 226 |
| Home & Garden | 115 |
| Sporting Goods | 50 |
| Health & Beauty | 46 |
| Hardware | 37 |
| Electronics | 30 |
| Animals & Pet Supplies | 25 |
| Office Supplies | 19 |
| Food, Beverages & Tobacco | 13 |
| Toys & Games | 13 |
| Business & Industrial | 10 |
| Baby & Toddler | 6 |
| Luggage & Bags | 6 |
| Arts & Entertainment | 4 |
| Software | 4 |
| Furniture | 4 |
| Religious & Ceremonial | 3 |
| Mature | 2 |
| Cameras & Optics | 2 |
| Media | 1 |
| Vehicles & Parts | 1 |
Then, on lower Tiers, the google product taxonomy has 190+ categories on Tier 2 and 1000+ categories on Tier 3.
Most usually website categorization is available as API or tool. In this way one can easily integrate it in own products and services.