{"id":19,"date":"2026-05-06T12:23:18","date_gmt":"2026-05-06T12:23:18","guid":{"rendered":"https:\/\/tailorfocus.com\/blog\/?p=19"},"modified":"2026-05-06T12:23:19","modified_gmt":"2026-05-06T12:23:19","slug":"small-models-for-small-jobs","status":"publish","type":"post","link":"https:\/\/tailorfocus.com\/blog\/small-models-for-small-jobs\/","title":{"rendered":"Small models for small jobs"},"content":{"rendered":"\n<h2 class=\"wp-block-heading\">(or, a funny thing happened on the way to the data center)<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">When the Tailor beta launches, the app will use a small model from Arcee AI called <a href=\"https:\/\/www.arcee.ai\/trinity\">Trinity-Mini-Base<\/a>, a 26B model with a mixture-of-experts layer that helps it work efficiently even at its small size.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Why Trinity-Mini?<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Small.<\/strong> Runs affordably at scale.<\/li>\n\n\n\n<li><strong>Fast.<\/strong> Low latency even with cold starts.<\/li>\n\n\n\n<li><strong>Stable.<\/strong> This matters when you&#8217;re figuring out what&#8217;s broken in your app.<\/li>\n\n\n\n<li><strong>Open source. <\/strong>The Trinity family has an Apache 2.0 license and open weights.<\/li>\n\n\n\n<li><strong>Training data provenance. <\/strong>Trinity was trained in partnership with Datology. <\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">Anyway, I encourage you to read <a href=\"https:\/\/www.arcee.ai\/blog\/the-trinity-manifesto\">The Trinity Manifesto<\/a>.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Core AI philosophy: Small models for small jobs<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">You don&#8217;t need a 600B model to break down your task list. You don&#8217;t need to burn down the rainforest to unpack your sprint.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">A small model, trained well for its specific job, can nearly match a large generalist in domain expertise. I proved this with Lenina. <\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Lenina is my fine-tune of Trinity-Nano-Base (6B), trained on approximately 31,000 data pairs to be a publishing assistant. It learned US copyright law. How to fill a form TX. How to apply for CiP data. How to choose BISAC codes. How to create structured metadata. And it hit 68.5% semantic similarity on complex tasks when I validated it against a held-out set of 1,543 data pairs. To put that in perspective, it performed within 2.5 percentage points of a 399B model even though it has 1\/66th the parameters.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The 399B model runs in a data center. Lenina runs <em>on my desktop PC<\/em>.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Why does that matter for Tailor?<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Inference costs.<\/strong>&nbsp;Because small models are cheaper to run, I can keep Tailor free with few, strategically placed ads (so you never see an ad when you&#8217;re head down working\u2014productivity poison). And the paid pro version, which unlocks additional features and removes ads, stays affordable. <\/li>\n\n\n\n<li><strong>Carbon footprint.<\/strong>&nbsp;I don&#8217;t want to boil the ocean reminding you to do your expense report. Smaller models mean a smaller environmental impact.<\/li>\n\n\n\n<li><strong>Data security.<\/strong>\u00a0Tailor runs on RunPod&#8217;s infrastructure. We never hit an inference provider&#8217;s API. Your data stays where it belongs\u2014with you.<\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">This isn&#8217;t just about efficiency. It&#8217;s about empowering the app with AI that I don&#8217;t lose sleep over.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">The tailor-made version for the app <\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">What&#8217;s going to make Tailor&nbsp;<em>special<\/em>&nbsp;is the model I&#8217;m training right now.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The Tailor model I&#8217;m training will know ADHD, not as a label, but as a lived reality. It will have behavioral training specifically designed to:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Support without controlling.<\/li>\n\n\n\n<li>Encourage without condescending.<\/li>\n\n\n\n<li>Break down tasks without overwhelming.<\/li>\n\n\n\n<li>Surface trends from your focus data (&#8220;You focus best Tuesday mornings&#8221;).<\/li>\n\n\n\n<li>Make helpful, contextual suggestions (&#8220;It&#8217;s Friday afternoon. That&#8217;s historically your best time for admin work&#8221;).<\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">Not a generic productivity bot. A focus companion that understands how a brain with ADHD works.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The productivity apps I&#8217;ve tried that market themselves as &#8220;ADHD friendly,&#8221; frankly, are not. They&#8217;re generic productivity apps. The assumption is that <em>if<\/em> it&#8217;s a productivity app then it <em>must<\/em> be for people with ADHD (because we don&#8217;t focus well). That&#8217;s not what Tailor is. Tailor\u2014the app and the model\u2014is built from the ground up to meet the challenges that adults with ADHD really have.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Why I&#8217;m the right person for this<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">I&#8217;m not an ML engineer. Or any kind of engineer. I&#8217;m a writer, editor, and publishing professional who taught myself fine-tuning because existing models, however big they get, do a handful of things that make them utterly impossible for me to use.<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>They talk to me like I&#8217;m stupid (looking at you, ChatGPT).<\/li>\n\n\n\n<li>They tell me, &#8220;You&#8217;ve worked hard today, go take a well-earned rest.&#8221; (Please never tell me to go rest when I&#8217;m motivated to focus.)<\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">(Oh, and rest is not &#8220;earned.&#8221; Rest is a human right. But go on.)<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>They personify themselves or impersonate humans. Ever had one say, &#8220;I get frustrated when that happens to me, too.&#8221; No you do not you are a highly sophisticated autocomplete algorithm not a person.<\/li>\n\n\n\n<li>They&#8217;re mostly accessible through chatbot wrappers that push them to engage users. Those bait questions at the end of every response? Designed to keep you engaging with them instead of doing what you need to do.<\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">So, like I said, not an ML engineer. But I&#8217;ve done this before. Lenina wasn&#8217;t just a proof of concept. It was an evaluated, documented success. I know how to:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Curate training data across behavioral categories.<\/li>\n\n\n\n<li>Hand-write examples to establish voice and values.<\/li>\n\n\n\n<li>Balance topical representation to avoid repetition.<\/li>\n\n\n\n<li>Teach a model epistemic humility (that is, to &#8220;I don&#8217;t know&#8221; when it doesn&#8217;t know).<\/li>\n\n\n\n<li>Evaluate training success against untuned baselines.<\/li>\n<\/ul>\n\n\n\n<p class=\"wp-block-paragraph\">Tailor is the same bet, just in a different domain.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">What&#8217;s next<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">Trinity-Mini-Base will take Tailor to beta. The custom model, when it&#8217;s ready, will take Tailor to the next level. I expect that when beta begins, you&#8217;ll see a productivity app, a focus companion, that works well and has some novel features (the intention \u2192 sprint \u2192 unpack core loop). But when we make the switch to the custom Tailor model, you&#8217;ll see the app come to life (in a good way, not in a creepy Frankenstein way).<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">I&#8217;m working on the training data now. I&#8217;ll share progress as I go: what&#8217;s working, what&#8217;s failing, what&#8217;s taking longer than expected, and so on.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">If you&#8217;ve ever wished a productivity tool actually understood how your brain works, that&#8217;s what I&#8217;m working on.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><a href=\"https:\/\/tally.so\/r\/kd0jLZ\">Beta signups are open<\/a>.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">\u2014Catherine<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><\/p>\n","protected":false},"excerpt":{"rendered":"<p>(or, a funny thing happened on the way to the data center) When the Tailor beta launches, the app will use a small model from Arcee AI called Trinity-Mini-Base, a 26B model with a mixture-of-experts layer that helps it work efficiently even at its small size. Why Trinity-Mini? Anyway, I encourage you to read The &#8230; <a title=\"Small models for small jobs\" class=\"read-more\" href=\"https:\/\/tailorfocus.com\/blog\/small-models-for-small-jobs\/\" aria-label=\"Read more about Small models for small jobs\">Read more<\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-19","post","type-post","status-publish","format-standard","hentry","category-uncategorized"],"_links":{"self":[{"href":"https:\/\/tailorfocus.com\/blog\/wp-json\/wp\/v2\/posts\/19","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/tailorfocus.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/tailorfocus.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/tailorfocus.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/tailorfocus.com\/blog\/wp-json\/wp\/v2\/comments?post=19"}],"version-history":[{"count":5,"href":"https:\/\/tailorfocus.com\/blog\/wp-json\/wp\/v2\/posts\/19\/revisions"}],"predecessor-version":[{"id":24,"href":"https:\/\/tailorfocus.com\/blog\/wp-json\/wp\/v2\/posts\/19\/revisions\/24"}],"wp:attachment":[{"href":"https:\/\/tailorfocus.com\/blog\/wp-json\/wp\/v2\/media?parent=19"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/tailorfocus.com\/blog\/wp-json\/wp\/v2\/categories?post=19"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/tailorfocus.com\/blog\/wp-json\/wp\/v2\/tags?post=19"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}