From c790672cc244ac4caba1bda3572829a6c6862891 Mon Sep 17 00:00:00 2001
From: Ben Sima <ben@bsima.me>
Date: Sun, 27 Oct 2019 09:48:52 -0700
Subject: move everything to namespace directories

---
 com/simatime/idea/duree-pitch.org | 80 +++++++++++++++++++++++++++++++++++++++
 1 file changed, 80 insertions(+)
 create mode 100644 com/simatime/idea/duree-pitch.org

(limited to 'com/simatime/idea/duree-pitch.org')

diff --git a/com/simatime/idea/duree-pitch.org b/com/simatime/idea/duree-pitch.org
new file mode 100644
index 0000000..d4d9d6f
--- /dev/null
+++ b/com/simatime/idea/duree-pitch.org
@@ -0,0 +1,80 @@
+#+TITLE: Duree: automated universal database
+#+SUBTITLE: seeking pre-seed funding
+#+AUTHOR: Ben Sima <ben@bsima.me>
+#+EMAIL: ben@bsima.me
+#+OPTIONS: H:1 num:nil toc:nil
+#+LATEX_CLASS: article
+#+LATEX_CLASS_OPTIONS:
+#+LATEX_HEADER:
+#+LATEX_HEADER_EXTRA:
+#+LATEX_COMPILER: pdflatex
+#+DATE: \today
+#+startup: beamer
+#+LaTeX_CLASS: beamer
+#+LaTeX_CLASS_OPTIONS: [presentation,smaller]
+Start with this:
+  - https://news.ycombinator.com/item?id=14605
+  - https://news.ycombinator.com/item?id=14754
+Then build AI layers on top.
+* Problem
+Developers spend too much time managing database schemas. Every database
+migration is a risk to the business because of the high possibility of data
+corruption. If the data is modeled incorrectly at the beginning, it requires a
+lot of work (months of developer time) to gut the system and re-architect it.
+* Solution
+- Using machine learning and AI, we automatically detect the schema of your data.
+- Data can be dumped into a noSQL database withouth the developer thinking much
+  about structure, then we infer the structure automatically.
+- We can also generate a library of queries and provide an auto-generated client
+  in the choosen language of our users.
+* Existing solutions
+- Libraries like alembic and migra (Python) make data migrations easier, but
+  don't help you make queries or properly model data.
+- ORMs help with queries but don't give you much insight into the deep structure
+  of your data (you still have to do manual joins) and don't help you properly
+  model data.
+- Graph QL is the closest competitor, but requires manually writing types and
+  knowing about the deep structure of your data. We automate both.
+
+* Unsolved problems
+- Unsure whether to build this on top of existing noSQL databases, or to develop
+  our own data store. Could re-use an existing [[https://en.wikipedia.org/wiki/Category:Database_engines][database engine]] to provide an
+  end-to-end database solution.
+* Key metrics
+- How much time do developers spend dealing with database migrations? What does
+  this cost the business? We can decrease this, decreasing costs.
+- How costly are failed data migrations and backups? We reduce this risk.
+* Unique value proposition
+We can automate the backend data mangling for 90% of software applications.
+* Unfair advantage
+- I have domain expertise, having worked on similar schemaless database problems
+  before.
+- First-mover advantage in this space. Everyone else is focused on making
+  database migrations easier, we want to make them obsolete.
+* Channels
+- Cold calling mongoDB et al users.
+* Customer segments
+- *Early adopters:* users of mongoDB and graphQL who want to spend time writing
+  application code, not managing database schemas. The MVP would be to generate
+  the Graph QL code from their Mongo database automatically.
+- Will expand support to other databases one by one. The tech could be used on
+  any database... or we expand by offering our own data store.
+* Cost structure
+** Fixed costs
+  - Initial development will take about 3 months (~$30k)
+  - Each new database support will take a month or two of development.
+** Variable costs
+  - Initial analysis will be compute-heavy.
+  - Following analyses can be computationally cheap by buildiing off of the
+    existing model.
+  - Customer acquisition could be expensive, will likely hire a small sales
+    team.
+* Revenue streams
+- $100 per month per database analyzed
+  - our hosted service connects to their database directly
+  - includes client libraries via graphQL
+  - may increase this if it turns out we save companies a lot more than $100/mo,
+    which is likely
+- enterprise licenses available for on-prem
+  - allows them to have complete control over their database access
+  - necessary for HIPAA/PCI compliance
-- 
cgit v1.2.3