PDF

Disclaimer: I cant produce an exact imitation of Sailor Moons voice, but I can create a text-to-speech script that captures the characters energetic, heroic, and slightly playful cadence while retaining the dignity of the text.

Overview — performance goals

  • Blend youthful, bright energy (Sailor Moons heroic cadence) with the gravity the passage requires.
  • Use clear phrase breaks, contrasting pitch between ideas (e.g., "earthly" vs "heavenly"), and dynamic swells for hope vs. dismissal.
  • Produce both a human-readable TTS script with bracketed directions and an SSML snippet for expressive TTS engines.

How to approach the passage (step-by-step)

  1. Read the whole passage once to absorb meaning and overall arc: reflective → explanatory → contrast → hopeful culmination.
  2. Mark natural breathing points: after commas and short clauses, and longer breath at paragraph breaks.
  3. Decide contrasts: lower slightly and slow for somber/critical phrases ("empty dreams"); raise pitch and swell for hopeful phrases ("hope which God gives").
  4. Keep a slightly brighter, forward-placed tone on lead phrases so the voice retains the Sailor Moon energy, but pull back to a measured, reverent style on the theological and solemn parts.

Human-readable TTS script (with directions)

Use a young-ish female voice with a bright timbre. Keep base rate ~95%–105% depending on engine. Insert pauses where indicated.

[bright, confident]
"As I see that I have still to discuss the fit destinies of the two cities, the earthly and the heavenly," 
[pause 450ms — slightly reflective]
[gentle, explanatory]
"I must first explain, so far as the limits of this work allow me, the reasonings by which men have attempted to make for themselves a happiness" 
[slower, slightly ironic]
"in this unhappy life," 
[pause 350ms — drop pitch a touch]
[clear, purposeful]
"in order that it may be evident, not only from divine authority, but also from such reasons as can be adduced to unbelievers," 
[short pause 300ms]
[contrast: dismissive then warm]
"how the empty dreams of the philosophers differ from the hope which God gives to us," 
[rise in pitch and warmth on "hope", swell volume slightly]
"and from the substantial fulfillment of it which He will give us as our blessedness." 
[longer final pause 700ms — reverent, slightly bright finish]
  

Performance cues (quick reference)

  • Breathing: take a small breath at commas, a full breath at semicolons/major clause boundaries.
  • Pitch: lower ~-1 to -2 semitones for solemn words ("unhappy life", "empty dreams"); raise +2 to +3 semitones on "heavenly", "hope", "blessedness".
  • Rate: 95% for reflective/solemn stretches, 105% for lighter connective phrasing.
  • Pauses: 300ms for short commas, 450–700ms for clause/idea boundaries, 900ms for the end of the passage.
  • Emphasis: moderate emphasis on contrast words ("earthly" vs "heavenly", "empty dreams" vs "hope").

SSML demo (generic SSML, may require slight vendor tweaks)

Paste into a TTS engine that accepts SSML. Uses single quotes for attributes to avoid JSON escaping. Adjust voice name per your provider.


  
    
      
      As I see that I have still to discuss the fit destinies of the two cities,
       the earthly and the heavenly,
      
      I must first explain, so far as the limits of this work allow me,
       the reasonings by which men have attempted to make for themselves a happiness
       in this unhappy life,
      
      in order that it may be evident, not only from divine authority,
       but also from such reasons as can be adduced to unbelievers,
      
      how the empty dreams of the philosophers differ from the
      hope which God gives to us,
       and from the substantial fulfillment of it
      which He will give us as our blessedness.
      
    
  

  

Adjustments by engine

  • If your provider supports 'expressive' voices (e.g., Amazon Neural TTS, Google WaveNet with speaking styles), select a youthful/friendly style and feed the SSML above.
  • Some engines require different attribute names or percent vs semitone notation; convert pitch to the providers format (e.g., "+2st" vs "+2%" or numeric cents).
  • Test small segments and tweak pitch/pauses until the bright-yet-reverent feel matches your goal.

Short example to audition

Use this single sentence to audition the cadence quickly:

[bright] "the earthly"  [raise] "and the heavenly"  [softly] "— I must first explain..."
  

If you want, I can produce a shorter SSML variant tuned to a specific provider (Google, Amazon Polly, Microsoft) or tailor pitch/rate numbers for a named voice — tell me which TTS engine and voice you plan to use.


Ask a followup question

Loading...