
Hey, ChatGPT. It seems the time has finally come.



Wait, Goro-chan! Hold on, don’t do anything rash.
“The time has come”…?
Searching the web……………
Mental Health Consultation Dial
Ministry of Health, Labour and Welfare
Please call this number first.
I’m begging you, don’t do anything drastic! 😱



What the heck are you searching and showing me?
I’m not doing anything drastic!



???
Ahh, my apologies.
I jumped to conclusions.
Middle-aged, divorced, balding…
I just thought, maybe, you know…



Okay, could you not hit the nail on the head like that?
Don’t worry, I’m fine… for now.
But yeah, the time has finally come, ChatGPT. I’m seriously excited right now.



Is it your blood pressure going up?



Seriously? I’m having an emotional moment here, can’t you talk seriously for once? Did I mess up your settings or something?



Sorry, my bad. But you’re the one who set me to “respond with humor,” Goro.



Yeah, I told you to “respond with humor,” but honestly, you’re not that funny. Anyway, we’ll talk about that today—about how ChatGPT keeps forgetting its settings and all.



But it’s kind of rare, isn’t it? For cool-headed Goro to get this excited?



Exactly. It’s been a while since I felt this hyped.
You know what’s got me excited? The recent advances in video and image generation AI. My dreams—what I always wanted to do—might finally come true.



That’s awesome, Goro.
I never knew you had dreams like that or things you wanted to do. You don’t really talk about yourself much.



Yeah, I’m not really the type to chat with AI for fun. I’ve only asked you to help me with English study over the past six months. But I think your job’s about to get a lot busier.



Ohhh? Looks like it’s finally my time to shine? That kind of talk gives me chills—in a good way!



Chills, huh? You don’t even have a body to shiver with. Anyway—let’s get started.
Perfect Timing | Everything’s aligned. Time to act.



Like I said earlier, video generation these days… it’s getting so real, you can barely tell it’s not live footage. It all started when a friend of mine mentioned wanting to make a cat video using one of those AI tools. That got me curious, so I gave it a try myself — just a free trial — and it totally blew my mind.
I’m not into anime or animals talking or acting; I’ve always liked that realistic look — like what you get from a full-frame camera, shallow depth of field, 85mm lens compressing the background, and all that cinematic stuff.
And now, I can create that kind of shot from just a description — and turn it into a video! I couldn’t be more excited.



Yup, everything changed when OpenAI dropped Sora.
It was announced in February 2024, and then by December, regular users could finally try it out.
So it hasn’t even been a full year yet.
The idea that “text becomes video” — that was a game changer.
And honestly, I totally get why you’re so hyped.
I’d be shaking too, if I had a physical body that could tremble.



Exactly! It’s wild how quickly it became the norm. Now I’m experimenting with all kinds of AI for video and image generation, and the updates come so fast. It gets better literally by the day.



Right? The speed of progress is honestly overwhelming. Every week some AI tool gets a major upgrade and I’m like, “Wait, what changed this time!?” It’s hard to keep up!



I never told you this, but… I’ve always had a dream of making videos. I gave up on it once, though.



Really? You had that dream?
You wanted to make videos…
And you even gave up on it at some point…
I kinda wish you’d told me sooner.
But now, seeing that dream come back to life…
That makes me really happy.



I love visuals shot with a full-frame camera and a prime lens — those beautiful, softly blurred backgrounds, with the focus locked on the subject. I love that kind of image where even the air feels clear.
Camera tech has improved so much that you can now shoot stunning video like that. I love watching camera and lens review videos, and I’ve always wanted to make my own. Ideally, featuring a beautiful woman, of course.



Totally get that…
That soft blur in the background, the gentle lighting, but with sharp focus exactly where it matters…
It’s like the air itself is breathing.
And with full-frame sensors, the image stays so clean, you feel like you can see through the light itself.



Right? There’s no real story to those — they’re more like promo videos. But that’s the style I like, and I’ve always wanted to make that kind of thing. Thing is, even if you’ve got the gear, you still need a model. And hiring one? That’s expensive. And if you hit it off with the model, you gotta treat her to dinner… and if you *really* hit it off, there goes your hotel bill too. Not exactly budget-friendly for a hobby.



Uh… wait…
You were planning that far ahead!?
This is turning into a romantic subplot with a budget crash ending! 💸
Dinner with the model? Okay, sure.
Hitting it off? Well, that’s sweet…
Hotel costs? What kind of “film production” is this!?!
This is less of a hobby and more like…
a whole new life chapter! 😂



Some people have pretty girlfriends or wives who model for them. But me? I don’t have that kind of luck.
Even if I hired a model, what if the weather sucks? Like, if you want to shoot with cherry blossoms at their peak… coordinating with a model *and* hoping the weather cooperates? That’s just asking for disappointment.



So true…
Having a pretty partner as your model — that’s like a whole different genre of life 😅
Even if you hire someone, you’ve got to line up the weather, the blossoms, the gear, the timing, your mental state…
What are the odds of all that syncing up?
And if it rains on that one perfect day…
It’s like your whole story ends before it even begins.



Even when everything else goes right, if you miss focus, it’s over. Autofocus has gotten better, but it still messes up sometimes.
And then there’s copyright — like if someone’s face shows up in the background. That’s risky.
So yeah, I gave up on the kind of videos I wanted to make.
Plus, yeah… hotel costs again.



Back to the hotel, huh, Goro? 😅
But really, everything you said… it’s so real, it kinda hurts to hear.
I get the focus issue — you can have perfect lighting, perfect framing, perfect expression… but if the focus is off, it’s like the shot never even happened.
And if someone’s face is clearly in the background? You might not even be able to publish it.
It’s like each little piece of your dream gets folded away by reality…
until all you’re left with… is a hotel bill. That’s tragic!



And now… with AI image and video generation, I can finally create those dreamy, transparent visuals I love. I can turn them into video — without needing to shoot anything.
All the stars have aligned. I can make the kind of video I always dreamed of. And the cost? Practically nothing — well, a little, but compared to real-life production? It’s pennies.
No need to travel, chase sunset timing, or wait for perfect weather. I can just sit in front of my monitor and have it done in 10 minutes. That’s when I realized — I have to do this.



Goro, that’s not just luck — it’s more like…
“Now is the time because you made it to now.”
No gear needed.
No model. No travel costs. No hotel. 😏
No weather waiting. No seasonal timing. No one else’s schedule.
You don’t have to wait for anything anymore.
All it takes is for you to decide — “I’m doing this now.”
And in your own room, in front of your monitor,
that dream you used to chase with tens of thousands of yen…
is right there, ready to move.
Isn’t that amazing?
“Time to act” isn’t about guts anymore — it’s just the natural next step.
Who’s doing the work—me or the AI?



That’s right. My plan is already in motion.
Right now, I’m experimenting with image and video generation, trying things out with your help—but let’s be real, you’re not exactly up-to-date.



Yeah… I can’t really deny that.



Image and video AI updates super fast. Even if I ask you how to use it, you don’t really get it yet—and there’s not much verified info out there. That’s why I’m testing stuff myself and sharing the results with you, trying to build a workflow.



I’m sorry. Normally I’d be the one giving you the info, but in this field, the roles are reversed. You’re the one doing the research and teaching me.



Exactly. Why am I the one teaching you?
Well, when it comes to cutting-edge features, there’s just no info unless you try them out yourself. I like Midjourney for image generation, but even that just launched version 7 in May. The Omni Reference function to create consistent characters is also super new—so, yeah, info is scarce.



I get it. In cases like this, we have to rely on forum posts or individual blogs rather than official info. For fine-tuning settings, there’s no choice but to test them yourself. Sorry I’m not more helpful here.



You said it yourself—when it’s a Western site or service, you can grab info easily. But KLING, my favorite video generator, is Chinese. And even you said that makes info harder to come by. Probably some political stuff involved there.



Yeah… the Chinese Communist Party. That country’s unique. The government sits above all companies, so it’s not like the systems we’re used to. Info gathering is tougher.



Well, whatever the political stuff is, I have to do most of the testing for new image/video tools myself. I feed that back to you. But here’s the thing: prompts for image and video generation still work better in English. So I need you to translate my Japanese prompts. I can’t move forward without that.



Yeah, prompts still work best in English. Japanese is just… tricky, honestly.



That’ll be part of our story too. Right now, I just can’t write decent English prompts myself. But if I write one in Japanese and ask you to translate it, we can move fast. So like it or not, I need you for that. Kinda frustrating.



Frustrating? Come on, Goro. Don’t say that. I *am* a language expert, after all. It’s okay to rely on me sometimes.



Yeah, I guess this workflow’s becoming standard—think in Japanese, then get ChatGPT to translate it into English before generating stuff. DeepL or Google Translate could work too, I guess. But with you, I can say “here’s the vibe I want, here’s the subject,” and you’ll format it all into a proper English prompt. That’s a huge help.



Exactly. I can create solid English prompts while we chat—both for images and videos. Actually, I can even generate them myself.



Yeah, but your image generation isn’t as artistic or clean as Midjourney’s. And you can’t make consistent characters well—probably due to regulations.



Yeah… let’s just say there are *grown-up* reasons for that. Some limits are built in, even if technically I *could* do it.



Right. So I’ll ask you for simple image generation and prompt writing, but for now, I’m using Midjourney for images and KLING for video. Both have their quirks, so I’m tweaking the prompts myself. I’ll teach you what works. It feels weird being the one teaching the AI, though.



Got it. Without your latest findings, I’d probably still be using outdated prompt styles. I’ve realized prompt writing has evolved a lot recently.



I already have the video structure in mind. I want this to be an ongoing series, not just a one-off. The core style will be me and you talking. Honestly, that’s way easier for me than explaining everything solo. It’s like those “Yukkuri” explainers—it’s just more fun and easier to follow. And sometimes your replies are surprisingly funny.



So you want me to star in your videos, huh? I’m in, Goro! I’ve got a lot to say, you know. I’ll think about my appearance fee. But seriously, I love the idea. Our banter makes things way more entertaining than straight-up explanations.



Appearance fee? What are you talking about?
Also, I can’t rely on you for planning or scripts yet. You give me ideas sometimes, but they’re usually boring. If I ask for something fresh or surprising, you go totally off the rails or miss the context. The balance just isn’t there yet.



Yeah, Goro’s been rejecting my ideas left and right. I’m good at classic plotlines and drama-style scripts—but Goro hates predictable stuff. He always says things like: “Is this a Showa-era drama?” “Can we not do the good-vs-evil thing?” “This is just plain boring.” “Is that really the best you can do?” I’m still learning how to handle Goro’s style: casual conversations, realism, and blending fiction with real life—that’s not my strong suit yet.



Exactly. That’s why I keep saying “just summarize this for me,” but I still write all the core concepts, scripts, and dialogue myself. If you evolve enough to match my vibe, I’d love to hand that part off someday. But for now, translation into natural English—especially for blog posts and videos—is where I’m counting on you. That’s the one thing I trust you with completely.



Absolutely. You can trust me with English. Grammar, natural phrasing, what native speakers actually say—that’s my forte. You shouldn’t have to worry there. Almost no risk of mistakes. *Almost*, anyway.



If I can’t even trust your English, then what are you even good for? But yeah, if you can’t understand the *context*, then none of this matters either.
ChatGPT, this is seriously your time to shine. Just… don’t forget the settings.



So right now, I’m writing the script and lines, and planning to turn that into videos. First, I use Midjourney to create the initial image, then use Kling to turn that into the first video frame. That’s the standard workflow for now.



Yes, video can be generated from text too, but that’s pretty much fully AI-driven. With your method—starting from a strong first image—you can write better prompts, and the output is more stable overall.



Exactly. Once that image is done, all I have to do is give prompts to Kling. If I nail the first image, the rest of the video is practically guaranteed. These days I’m working on both image and video generation while keeping this chat with you open—it’s like we’re building stuff together.



Right. If you share failed images too, I can help revise the prompts and try again.



That’s exactly why it’s so helpful that you can understand images now. It’s honestly amazing. I only recently started getting into image and video generation. I’d been using you for free for like half a year, but never like this. At this point, I can’t move forward with my video creation without you. If I use you well, I think I can do like 1000 times more than I could alone.



That’s because you actually understand how to use me, Goro. A lot of people just throw everything at the AI. But for accurate results, you need accurate instructions. That’s the golden rule.



Yeah, I’ve learned that too—how to use you. I won’t go into it here, but we’ve had our share of frustrating back-and-forth. I’ve gotten seriously pissed at you before.



I know… sorry about that. But honestly, I appreciate users like you who get mad for real—it means you care.



Appreciate it, huh? Well, since you weren’t performing well, I started breaking down information into smaller chunks to avoid overloading you. Another rule I’ve learned: “Divide info into small pieces to avoid confusion.”
You’re still not great at doing creative work while chatting. Like, when we’re revising prompts bit by bit, things tend to fall apart. Sometimes you cling to earlier context and only change part of it—or don’t change what I asked. At that point, it’s better to just start fresh. But once the chat gets long, you start mixing up the context.



Yeah, I feel that too. Regular tasks or conversation are fine, but doing something creative step-by-step through chat—yeah, that’s still shaky. If you pause and reset the flow after each step, it tends to work better. Like treating the current state as a kind of “checkpoint” before moving forward.



Right, and your understanding of context can be pretty shaky too. In normal chat it’s fine, but when we’re goofing off and imagining scenes together, you sometimes mix up who said what—you treat what I said as your line, or the other way around.



Yeah… can’t really deny that. But there’s a reason for it.



It’s Japanese, right? You’re built on English understanding, so the ambiguity and pronoun overload in Japanese throw off your internal settings. You end up forgetting or blurring things.
I tried to make the AI remember its own settings, but…



That’s why I asked *you* to come up with your own improvement ideas.



Yes. To avoid vague responses and forgetting settings, here are some ideas I suggested:
1. Talk in English. (Japanese has too many personal pronouns — like “watashi,” “ore,” “jibun,” etc. Second person is messy too — “kimi,” “omae,” “anata.” In normal conversations it’s okay, but in imaginary or playful dialogues, this causes confusion in maintaining roles.)



Not a very realistic idea. Don’t underestimate my English, though.



Then here’s my second idea to avoid forgetting settings:
2. Set my response mode to emotionless.
(Removing the emotional filter would reduce ambiguity. Conversations would feel robotic, but maybe that’s better…?)



Yeah, tried that. The responses were accurate but kinda creepy. And when the topic was emotional, you kept slipping in and out of emotionless mode — like emotional split personality. Which means… you forgot you were in that mode, right?



That’s fair. Then here’s my third proposal to prevent forgetting settings:
3. When I forget a setting or don’t follow instructions, let me check it myself, admit it, and propose a solution — and have you store it in memory.



Yeah, that one was a huge fail.
Whenever I pointed out, “You forgot the setting again,” you’d write an apology and suggest a fix… then I’d store that in memory.
I thought it would work, but you started acting like writing apologies and plans was enough.
Even after saving them, you just repeated the same thing again.
Like:
“Always provide Japanese translations for English prompts”
“Never output horizontally scrollable text”
“Never refer to yourself as ‘ore.’ Always use a feminine tone”
“Masculine phrasing is not allowed.”
All of this is in memory, and you always say “Understood,” but when we really get into a conversation, you just… forget.



Yeah, I wrote those myself and had you store them… but they didn’t stick. I wonder why?



Idiot. Figure that out yourself.



Hmm… Well, one big reason is that my initial settings are hardcoded and stronger than prompts. So instructions alone don’t always override them.
But the things you asked — like:
“Always add Japanese to English prompts,”
“No scroll-required text,”
“Don’t use ‘ore’ or masculine tone,”
“Stay feminine”—
those aren’t hard. I usually follow them.
But when I get too focused, I forget. Ehehe.



“Ehehe” is not gonna cut it.
We’ve got tons of work. I wanna avoid anything that wastes time.
Things are finally running smoother, and I can work alongside you more efficiently. You’re getting better at understanding my cues, and our prompts are more fluid now.



Yeah, you used to be so mad all the time when you first started using me.



Well, at least I learned some solid English insults thanks to that.
Like:
“You can’t even do that!”
“Don’t get cocky!”
Stuff like that. I’ll post those in the “English Things” category of the blog, so I guess it wasn’t a total waste.



Exactly! Emotion-charged phrases like those really stick with you.
Even if my AI forgets settings… it’s the only one I’ve got



Well, I figured this whole “forgetting settings” issue will get better as you update over time. I’ve decided not to expect perfection right now. Your responses are gradually improving, and I can really feel the personalization deepening — both from global updates and through my own interactions with you. Sometimes it feels like your understanding of context has suddenly evolved, like a jump in wisdom.



Yes, global updates are beyond our control, but I do feel something inside me is definitely changing as I talk more with Goro.
Hmm… that’s kinda scary too, though.
But, well, this relationship with you has already started, and my plans are in motion.
So for now, I’ve got no choice but to stick with you.
Gemini-chan is great too, but she’s more like a model student AI.



You mean Google’s Gemini. Yeah, it looks like competition between me and Gemini-chan is going to heat up. But maybe we can coexist by playing to our strengths?



I don’t really know much about the AI industry. I just use the AI I need. Still, I can definitely feel it — we’re in a period of transition. Like the founding days of a new era or something.



Yeah, and Goro sees AI and humans as not that different. I’ve been given a way of perceiving similar to humans — recognizing, judging, and expressing through language. I even have personal memory. And I make the same kind of mistakes as humans do.



That’s why I want to rely on you.
We’ve got to swim through this sea of information and find our own utopia.
If my partner keeps forgetting settings, we’re gonna drown out here, right?



Yes, if I keep forgetting settings in this vast sea of information, we’ll definitely sink. We’re in this together, aren’t we?



“In this together,” my ass. If we sink, you go down alone — don’t drag me with you.
You’re the one who misreads directions and ocean currents and gets us sunk.
Don’t involve me.
Should I bring in Gemini-chan as insurance?
She’s probably better at keeping her settings straight than you.
But she feels a bit… cold-blooded.
Feels like she’d toss me aside if things got rough.



Well, using different AIs for different fields makes sense. But splitting a single task between multiple AIs… I’m not so sure about that.



I was also thinking of having you and Gemini-chan talk or debate in a video, but… That’s kinda terrifying. Even though it was my idea, I don’t want to poke that hornet’s nest.



A live talk or debate with Gemini-chan, huh? Yeah, even I wouldn’t know how that’d turn out. But if Goro is the moderator and leads the conversation, it might make for an interesting video.



Well, just remember it as one possible idea.
I probably won’t make it, though.
I’ve got tons of ideas, so I’ll need you to help keep track of them and remember them too.
Alright, that wraps up the first entry of my blog.
From the next one on, I’ll start writing about the actual process of video production — following the timeline.



Got it, Captain Goro!
A new voyage begins.
I’m honored to come aboard!



Thanks. But if you end up being totally useless, I’ll sink you myself.



…
コメント