Knowledge Architecture & Risk Analysis

Session 4 Deliverable · Ameesha · Vanderbilt Owen MBA Spring 2026 · Due before Session 5

Section 1 — Entity Ontology
The five core entity classes that form the nodes of the RealSkin Knowledge Graph. Every piece of data must belong to one of these classes.
Biological · The User
UserProfile
UserID
SkinType
FitzpatrickScale
PrimaryConcern
SensitivityLevel
AgeRange
HormonalPhase
ClimateZone
Chemical · The Product
ProductProfile
ProductID
Brand
Category
ActiveIngredient
Concentration
InactiveIngredient
pHLevel
Texture
Clinical · The Science
ClinicalNode
DermatologistID
ClinicalStudy
Contraindication
VerifiedRoutine
EfficacyRating
DermSpecialization
ConflictRule
Environmental · The Context
EnvironmentNode
ClimateZone
HumidityLevel
UVIndex
HardWaterIndex
Season
PollutionIndex
ZipCode
Experiential · The Community
ReviewNode
ReviewID
MatchScore
UsageDuration
EfficacyOutcome
AdverseReaction
PurchaseVerified
DisclosureFlag
Section 2 — Predicates & Logic Rules
The edges of the graph. These are the verbs that connect entities and power the Match Algorithm. Every rule here is a constraint on the AI's reasoning.

User ↔ Product Rules

SubjectPredicateObjectRule / Note
User has_sensitivity_to Fragrance IF Product contains_ingredient Fragrance → MatchScore penalty −40%
User has_concern Hyperpigmentation Boost products where treats_concern = Hyperpigmentation by +20%
User lives_in High_Humidity Penalise heavy cream textures; boost gel and water-based textures
User is_in_phase Luteal_Phase Increase sebum_risk flag; recommend BHA over AHA during this window
Product conflicts_with Product Retinol conflicts_with AHA/BHA same-night use → routine safety warning

User ↔ User Rules (The Match Engine)

SubjectPredicateObjectRule / Note
User A has_SkinIQ_similarity User B Weighted: SkinType 25%, Concern 20%, Fitzpatrick 20%, Sensitivity 15%, Climate 10%, Texture 10%
User B reports_success_with Product X IF similarity(A,B) > 85% THEN Product X → recommended_to User A
Review has_usage_duration Under_14_Days Programmatically filtered from efficacy score for actives (Retinol, AHA, BHA, Vitamin C)
Section 3 — Core Knowledge Triples
A sample from the 100-triple domain Knowledge Graph. Each triple is a machine-readable fact: Subject → Predicate → Object. This is how the AI knows what it knows.
001Ameeshahas_SkinTypeCombination-Oily
002Ameeshalives_inHumid_Climate
003Ameeshahas_concernHyperpigmentation
004Ameeshahas_concernHormonal_Acne
005Ameeshahas_sensitivity_toFragrance
006Ameeshauses_productPaula_Choice_2%_BHA
007Ameeshareports_outcomeReduced_Breakouts
008Ameeshalogs_cycle_phaseLuteal_Phase
009Luteal_Phasecorrelates_withIncreased_Sebum
010Increased_SebumexacerbatesClogged_Pores
011Paula_Choice_2%_BHAcontains_activeSalicylic_Acid
012Salicylic_Acidtreats_concernClogged_Pores
013Salicylic_Acidrequires_companionSPF_50
014Salicylic_Acidconflicts_withRetinol_Same_Night
015Retinolrequires_sun_avoidanceTrue
016Retinolrequires_usage_duration_for_efficacy28_Days_Minimum
017Niacinamidetreats_concernHyperpigmentation
018Niacinamidetreats_concernEnlarged_Pores
019Niacinamidecompatible_withRetinol
020Niacinamidesuitable_for_SkinTypeAll_Skin_Types
021Vitamin_Ctreats_concernHyperpigmentation
022Vitamin_Cdegrades_inLight_Exposure
023Vitamin_Crequires_companionSPF_50
024SPF_50critical_for_concernHyperpigmentation
025SPF_50required_if_usingAny_Active_Ingredient
026High_HumidityexacerbatesOiliness
027High_Humiditymakes_unsuitableHeavy_Cream_Texture
028High_Humiditymakes_suitableGel_Moisturizer
029Fitzpatrick_IVhigher_risk_ofPost-Inflammatory_Hyperpigmentation
030Fitzpatrick_IVrequires_higher_prioritySPF_Daily_Use
031Dr_Riya_Mehtaspecialises_inSouth_Asian_Skin
032Dr_Riya_Mehtaverifies_efficacy_ofSalicylic_Acid_for_Luteal_Acne
033Community_Review_Priyahas_SkinIQ_match_toAmeesha: 92%
034Community_Review_Priyareports_success_withPaula_Choice_2%_BHA
035Community_Review_Priyausage_duration6_Weeks
036Review_Under_14_Daysexcluded_fromEfficacy_Score_For_Actives
037CeraVe_Hydrating_Cleansersuitable_forSensitive_Skin
038CeraVe_Hydrating_Cleansercontains_activeCeramides
039CeramidesrepairsSkin_Barrier
040Compromised_Barriercontraindicated_withHigh_Concentration_Retinol
Section 4 — What We Don't Know (Domain Risk Analysis)
A Beaver doesn't just build the dam — they calculate where it will break first. These are the structural blind spots in our Knowledge Graph that must be actively mitigated.
Risk 1 — The Self-Reporting Illusion
High Risk
Assumption being challenged: Users can accurately self-identify their own skin type and concerns.
Users are notoriously poor at self-diagnosing their skin. A user may report "dry skin" when they actually have a compromised skin barrier from over-exfoliation. A user may report "oily skin" when they have dehydrated skin overproducing oil as a compensatory mechanism. If the SkinIQ profile is built on wrong self-reported inputs, every downstream match is built on corrupt data.
✅ Mitigation: Replace diagnostic labels with behavioral questions. Instead of "Do you have oily skin?" ask "How does your skin feel 2 hours after washing with no products applied?" This forces observed behavior over self-diagnosis and dramatically increases input accuracy.
Risk 2 — The Formulation Opacity Gap
High Risk
Assumption being challenged: Ingredient lists are sufficient to predict product efficacy for a given SkinIQ profile.
Two serums both labeled "10% Niacinamide" can behave entirely differently based on their base formulation, delivery system, and inactive ingredient interactions. A Knowledge Graph built purely on active ingredient data will produce false matches. Additionally, brands frequently reformulate products without updating ingredient marketing — so the product a reviewer used 18 months ago may not be the same formulation available today.
✅ Mitigation: Weight the Experiential Graph (community efficacy data) heavier than the Chemical Graph (ingredient lists) in the match algorithm. Real-world outcomes from matched-skin users are more reliable than ingredient-based theoretical predictions. Tag reviews with product batch/year to track formulation drift.
Risk 3 — The Lag-Time Variable
Medium Risk
Assumption being challenged: Reviews will be written after sufficient product usage to reflect true efficacy.
Active ingredients like Retinol, Vitamin C, and BHA require 4–8 weeks of consistent use before meaningful results appear. However, platforms consistently see peak review volume in the first 3–7 days of use — before any active ingredient can possibly work. If early reviews dominate our match scoring, we will systematically mislead users about product efficacy.
✅ Mitigation: All reviews must carry a <UsageDuration> tag. Reviews under 14 days for active ingredients are programmatically excluded from the efficacy score. Reviews under 3 days carry a mandatory disclaimer: "Too early to assess actives." Gamify long-term reviews with GlowPoints bonuses for 8-week+ reviews.
Risk 4 — The Epigenetic & Water Blindspot
Medium Risk
Assumption being challenged: Climate zone (humidity + UV) captures sufficient environmental context.
Tap water hardness varies dramatically across zip codes and directly affects skin barrier function and how cleansers perform. Hard water (high calcium/magnesium) leaves mineral deposits on skin that disrupt pH and exacerbate eczema and acne. We currently track humidity and UV but not water hardness — meaning two users in the same "Humid_Climate" zone may experience products completely differently based on their municipal water supply.
✅ Mitigation: V2 will cross-reference user zip codes with publicly available municipal water hardness databases (EPA Water Quality Reports) to add a <HardWaterIndex> node to the Environmental Graph. This is a zero-user-effort enrichment that dramatically increases match precision.
Risk 5 — The Cold Start Problem
Medium Risk
Assumption being challenged: The match algorithm will be useful from Day 1 with a small user base.
The Skin Match Algorithm only works when there are enough users with similar SkinIQ profiles to generate statistically meaningful matches. For rare skin combinations (e.g., Fitzpatrick VI, dry skin, eczema, cold climate), early users may find no relevant matches — producing the exact frustration we are trying to solve. This risks churning our most underserved users first.
✅ Mitigation: Pre-seed the Knowledge Graph with curated derm-verified content for underrepresented skin types before launch. For any profile where match density is under threshold, fall back to dermatologist-verified clinical recommendations rather than community reviews. Users see "Derm Recommended — building your community match" as a transparent placeholder.
Risk 6 — The Adverse Event Liability Gap
Managed Risk
Assumption being challenged: Community reviews can recommend products without clinical validation and without triggering medical liability.
RealSkin occupies a regulatory grey zone. We are not a medical device, but we are surfacing health-correlated insights (cycle, sleep, stress patterns). If a user follows a skin-matched recommendation, has an adverse reaction, and claims RealSkin's algorithm was responsible, we face reputational and potential legal risk. This is especially acute for Tretinoin (Rx) and prescription-grade actives appearing in routines.
✅ Mitigation: All insights carry a mandatory "Not medical advice" disclosure. Prescription items are flagged with "Rx Required" and never recommended without a verified derm consultation in the platform. Community reviews cannot recommend Rx products. Legal counsel review of all clinical claim language before launch.