Video format: Option B — talking head + lower-third text cards (no slide cutaways). Script v3, 357 words ≈ 2:44 at 130 wpm. Deck: 10-page v4, single source for both Alchemist Enterprise B2B Course (deadline 5/31, deck only) and J-StarX SV Extended Program (deadline 6/22, deck + 3-min video). Submit video as an unlisted YouTube/Vimeo link.
357 words · target pace 130 wpm = ~2:44 with ~15s pause headroom under 3:00. Camera stays on you the whole time. The mark tells the editor when to drop in a text overlay (lower-third or full-bleed). 6 cards total over 3 minutes — visual punctuation, not slides. Teleprompter-formatted version: script-teleprompter.html.
Total hands-on time ~2–4h. Record in the 7 sections so any flub is a 20–30s redo, not a full restart.
CARD: … cue, add a text overlay matching what the script specifies:
| Choice | Reason |
|---|---|
| Option B (talking head + text cards) for the video | Per Alchemist insider guides + J-StarX applicant videos: the application screen is about *the founder*, not slide-heavy product explainers. Pure delivery + occasional text accents shows you can sell — the core B2B founder test. (Full slide cutaways are more of a Demo Day style.) |
| 10-page deck (v4) as the single source for both programs | Enterprise B2B Course asks for "approximately 10 pages"; J-StarX has no page minimum. One deck covers both — no version drift, no extra maintenance. |
| Cover line "Before anyone says a word" opens the spoken hook | Echoes the cover slide — burns the message in immediately and pays off again on the closer line. |
| Founder-origin story still anchors Section 1 | The 3-judge panel ranked this highest: the 6-years-on-a-board story earns your right to the pitch before any product claim. |
| "Built into the data itself, not into the screen" | Replaces the v1's "database layer" jargon. Same meaning, plain language. |
| "From an acceleration partner" (not naming a specific program) | Program-neutral phrasing — script + deck reusable across J-StarX, Enterprise B2B, and any other applications. |
| No hard TAM/ARR number spoken | Conservative per investor canon (TAM/SAM/SOM are hedged pre-pilot). The deck carries the figures with "model" qualifier; the video stays defensible. |
| "pre-pilot, no paying customer logos" | Honest traction is canon. Never claims customers/pilots that don't exist (2026-05-28 fabrication lesson). |
| "designed to align with" | Avoids the banned "fully compliant" / "legally safe." |
| "We never judge what your teams say" | Closer is accurate (Kashi never judges on content) and avoids the banned "we do not transcribe" sleight-of-hand. |
Full line→source map + the forbidden-phrase grep log: 00-script-canon-audit.md.
Deck used = Kashi_PitchDeck_v3_Final_2026-05-27.pdf (16 JA slides). The 3-min EN script uses 10 of them as cutaways; 6 are deliberately skipped (reasons at the bottom). Edit in CapCut: drop each PDF page as an image overlay at the listed timestamp; the rest of the time your face is on camera.
| Time | Slide | What you say at that moment (key phrase) |
|---|---|---|
| 0:00 – 0:10 | 01 Cover | "For six years, I sat on the board of a multinational small company in Japan." |
| 0:10 – 0:30 | 02 Problem | "Meeting by meeting, I watched good teams quietly fall apart. Every survey came back without a flag. By the time anyone spoke up, the best people had already quit." |
| 0:30 – 0:45 | 04 気づき | "I'm building Kashi. In Japan, preventing workplace harassment is now a legal obligation — even for small firms, under the MHLW law. But it stays invisible until people leave… every existing tool depends on someone speaking up, or on what they say." |
| 0:45 – 1:00 | 05 Solution | "Kashi doesn't judge what was said. It reads the shape of how a recurring team interacts across many meetings: turn timing, who gets interrupted, whose contributions get no response." |
| 1:00 – 1:13 | 14 Why Now | "Here's the shift. Japan's mandate just expanded to small firms, and the EU AI Act — Article 5 — restricts workplace emotion-recognition. So tools that read content or infer sentiment now face real exposure." |
| 1:13 – 1:25 | 13 差別化 | "Kashi was designed to sit on the other side of that line: deterministic, rule-based detectors, no language model in the judgment path, surfacing patterns over 30-, 90-, and 180-day windows." |
| 1:25 – 1:45 | 06 Product | "The output is a private mirror to the manager themselves, plus aggregate patterns — five people or more — for executives. Content-reading tools can't reach that posture without redesigning their products." |
| 1:45 – 2:05 | 07 見えるもの / 見せないもの | "Four pillars, all built for regulatory defensibility. One — structural, not emotional… Two — self-reflection, not surveillance… Three — bounded visibility is architecture, not policy: five-person minimum, enforced at the database layer. Four — barred by design from any performance, discipline, or termination use." |
| 2:05 – 2:25 | 08 現在地(N=1) | "I'll be straight: we're pre-pilot, with no paying customer logos to share. But the analyzer works — bit-for-bit deterministic, 908 tests… simulated meetings… zero false positives in the no-signal cases. Demo-tenant data runs live in production." |
| 2:25 – 2:55 | 16 Team + Ask | "I'm Justine Acaylar — at Keio, building product, code, and sales solo. Filipino-Japanese: exactly the dialogue this is built for. Kashi isn't surveillance software. It's trust infrastructure… From JETRO and Alchemist, I'm asking for North-American market-fit validation and large-enterprise PoC introductions." |
| 2:55 – 3:00 | 01 Cover (bookend back to wordmark) | "We never judge what your teams say. We help them hear how they say it." |
| Skipped slide | Why not in the 3-min video |
|---|---|
| 03 既存アプローチが見逃すもの | Redundant with 02; the script already lists existing tools verbally ("surveys, hotlines, content classifiers"). Keep it in the deck for the JETRO panel to read in the PDF. |
| 09 6 ヶ月で検証すること | This is the 1stRound-specific 6-month validation plan. JETRO ask is different (NA validation + enterprise PoC intros). Out of scope for the video. |
| 10 初期顧客 | The 3-min script doesn't explicitly state "50–500-person SMB" target. If you want this on screen, swap it in at 0:30–0:45 in place of 04 (your call). |
| 11 市場規模 | Deliberately not flashed on screen. The video stays conservative (canon hedge — TAM/SAM/SOM are "figures undisclosed pre-pilot"). The deck PDF carries the numbers; the spoken video doesn't show them. |
| 12 ビジネスモデル | The video doesn't discuss pricing. Stays in the deck. |
| 15 1stRound が解く 6 ヶ月の壁 | 1stRound-specific — wrong audience for a JETRO submission. Replace with the JETRO ask language (already in the script). |
In CapCut: import the PDF (or 16 PNG screenshots — Preview ⌘P → "Save as PDF" gives you each page). For each row above, drop the matching page as an image layer at the timestamp, sized to fill maybe 70% of the frame on the right while your face stays on the left third. Or: cover full-frame, then cut back to your face for the next sentence. Keep the cover-back transitions short (≤0.3s).
Spec confirmed by the form itself: pitch video = within 3 minutes / English / accessible link. The 2:53 script fits. Below: drafted answers (copy-paste) + the fields only you can fill.
245 words · 0 forbidden-phrase hits · Product + Issues + Target as the form asks. Copy verbatim:
Kashi is a B2B SaaS that helps organizations see dialogue breakdown in their teams before anyone has to speak up — by reading the structure of how a team interacts across many meetings, never the content of what is said.
The problem. In Japan, preventing workplace harassment is now a legal obligation, expanded to small and mid-sized firms under the MHLW パワハラ防止法. But organizational breakdown stays invisible until people quit, because every existing tool — engagement surveys, hotlines, content classifiers, organizational network analysis — depends on someone speaking up, or on what they say. The structural warning signs appear months earlier, in how meetings actually run, and no existing tool reads them.
The product. Kashi converts meeting transcripts into structural features — turn timing, interruption direction, floor-share asymmetry, response latency — and surfaces repeated patterns across 30-, 90-, and 180-day windows using deterministic, rule-based detectors, with no language model in the judgment path. The output is a private mirror to the manager themselves, plus aggregate patterns (five people or more) for executives. Raw content is not retained past a short TTL. The design is built to align with the EU AI Act (Article 5, which restricts workplace emotion-recognition) and is APPI-aware on data retention.
Target. Our initial market is 50-to-500-person, meeting-dependent firms in Japan that already use Zoom, Teams, or Meet with recording enabled, and that carry MHLW harassment-prevention obligations. We are pre-pilot today, with a working analyzer that is bit-for-bit deterministic on transcript input.
Check these 3 (they map exactly to the video's ask):
| Field | Answer |
|---|---|
| 氏名(日本語)/ Name (JA) | アカイラル・ジャスティン (adjust to your registered katakana) |
| 氏名(英語)必須 — include prefix | Mr. Justine Acaylar |
| 役職名(日本語)必須 | 創業者(ファウンダー) |
| 役職名(英語)必須 | Founder |
| Email 必須 / re-enter | ⚠ your email |
| 電話番号 必須 | ⚠ your phone |
| ⚠ your LinkedIn URL | |
| Facebook (optional) | ⚠ optional |
kashi.ai-labo.workers.dev for JA and EN. Fine if the site is bilingual or EN-readable; if it's JA-only, that's acceptable here (the field just needs a working link).