URL-encoding URLs in AppleScript

The AppleScript Safari API is apparently quite finicky and rejects Russian Cyrillic characters when loading URLs.

For example, the following URL https://en.wiktionary.org/wiki/стоять#Russian throws an error in AppleScript. Instead, Safari requires URL’s of the form https://en.wiktionary.org/wiki/%D1%81%D1%82%D0%BE%D1%8F%D1%82%D1%8C#Russian whereas Chrome happily consumes whatever comes along. So, we just need to encode the URL thusly:

use framework "Foundation"

-- encode Cyrillic test as "%D0" type strings
on urlEncode(input)
   tell current application's NSString to set rawUrl to stringWithString_(input)
   -- 4 is NSUTF8StringEncoding
   set theEncodedURL to rawUrl's stringByAddingPercentEscapesUsingEncoding:4 
   return theEncodedURL as Unicode text
end urlEncode

When researching Russian words for vocabulary study, I use the URL encoding handler to load the appropriate words into several reference sites in sequential Safari tabs.

--
-- Created by: Alan Duncan
-- Created on: 2017-11-01
--
-- Copyright (c) 2017 Ojisan Seiuchi
-- All Rights Reserved
--

use AppleScript version "2.4" -- Yosemite (10.10) or later
use scripting additions
use framework "Foundation"

set searchTerm to the clipboard as text
set openRussianURL to "https://en.openrussian.org/ru/" & searchTerm
set wiktionaryURL to "https://en.wiktionary.org/wiki/" & searchTerm & "#Russian"
set forvoURL to "https://forvo.com/search/" & searchTerm & "/ru/"
set ruWiktionaryURL to "https://ru.wiktionary.org/wiki/" & searchTerm

tell application "Safari" to activate

-- load word definitions
tell application "Safari"
   activate
   set i to 0
   set tabList to every tab of window 1
   set tabCount to count of tabList
   repeat tabCount times
      tell window 1
         set i to i + 1
         set textURL to (URL of tab i) as text
         -- load the word in open russian
         if textURL begins with "https://en.openrussian.org" then
            set encodedURL to urlEncode(openRussianURL) of me
            
            set URL of tab i to encodedURL
            
         end if
         -- load the word in wiktionary
         if textURL begins with "https://en.wiktionary.org" then
            set URL of tab i to urlEncode(wiktionaryURL) of me
            -- make the wiktionary tab the active tab
            try
               set current tab of window 1 to tab i
            end try
            
         end if
         
         if textURL begins with "https://forvo.com" then
            set URL of tab i to urlEncode(forvoURL) of me
         end if
         
         if textURL begins with "https://ru.wiktionary.org" then
            set URL of tab i to urlEncode(ruWiktionaryURL) of me
         end if
      end tell
   end repeat
end tell

-- encode Cyrillic test as "%D0" type strings
on urlEncode(input)
   tell current application's NSString to set rawUrl to stringWithString_(input)
   set theEncodedURL to rawUrl's stringByAddingPercentEscapesUsingEncoding:4 -- 4 is NSUTF8StringEncoding
   return theEncodedURL as Unicode text
end urlEncode

Consume media outside one's bubble?

That “reality bubbles” contribute heavily to increasing political polarization is well-known. Customized media diets at scale and social media feeds that are tailored to individual proclivities progressively narrow our understanding of perspectives other than our own. Yet, the cures are difficult and uncertain. Often, though, we’re advised to consume media from the other side of the political divide.

A sentence from a recent piece in The Atlantic encapsulates why I think this is such a fraught idea:

But somehow, 80 percent of Republican voters said they believe that the virus is at least “somewhat under control” in the same week that cases reached record numbers.

James Hamblin How Trump Sold His Pandemic Failures to Voters (The Atlantic 2020-11-10)

Here’s the problem. If the media consumed by 80% of Republican voters is so careless with its delineation between facts and politically-motivated discourse, why would someone recommend it to a progressive trying to learn something about the opposite ideological camp? Of what use is it to me to consume media that carelessly or maliciously handles facts? In asking progressives to consume conservative media like Fox News and their ilk, you are setting up an equivalency with centrist and neutral media outlets - an equivalency that does not exist.

References

Свидетельство того или тому?

I was puzzled by this sentence on the BBC Russian Service:

Нет свидетельств тому, что на нынешних выборах дело обстоит иначе.

ББС
  <cite>Мошенничество на выборах в США? Проверяем факты в речи Трампа</cite>

It means “There is no evidence that in the current election things are any different.” but the puzzle isn’t the meaning, it’s the grammatical case in which the author has placed the demonstrative pronoun то , which is dative here тому . The thing is that you see examples where either the genitive or the dative follows свидетельство . So what’s the difference?

Escaping "Anki hell" by direct manipulation of the Anki sqlite3 database

There’s a phenomenon that verteran Anki users are familiar with - the so-called “Anki hell” or “ease hell.”

Origins of ease hell

The descent into ease hell has to do with the way Anki handles correct and incorrect answers when it presents cards for review. Ease is a numerical score associated with every card in the database and represents a valuation of the difficulty of the card. By default, when cards graduate from the learning phase, an ease of 250% is applied to the card. If you continue to get the card correct, then the ease remains at 250% in perpetuity. As you see the card at its increasing intervals, the ease will remain the same. All good. Sort of.

Typing Russian stress marks on macOS

While Russian text intended for native speakers doesn’t show accented vowel characters to point out the syllabic stress (ударение) , many texts intended for learners often do have these marks. But how to apply these marks when typing?

Typically, for Latin keyboards on macOS, you can hold down the key (like long-press on iOS) and a popup dialog will show you options for that character. But in the standard Russian phonetic keyboard it doesn’t work. Hold down the e key and you’ll get the option for the letter ë (yes, it’s regarded as a separate letter in Russian - the essential but misbegotten ë .)

Stripping surveillance parameters from Facebook and Google links

While largely opaque to most users, Facebook and Google massage any links that you acquire on their sites to include data used to track you around the web. This script attempts to strip these surveillance parameters from the URL’s. It is by no means all-inclusive. Imaginably, there are links that I haven’t yet encountered and that need to be considered in a future version. So consider this a proof-of-concept.

The problem

For example, I performed a Google search1 for “Smarties”. Inspecting the first link - to Wikipedia, I see:

Predictions 2021

Predictions for 2021

Humans are notoriously poor at assigning probabilities to events, even those that are highly relevant to their daily lives. This year I’m making a deliberate attempt to calibrate my prediction abilities by correlating predictions with reality. The judgments of truth of these outcomes will be made on December 31, 2021, although some of the outcomes will have been decided substantially in advance of that.

Coronavirus

  1. An effective vaccine will be widely available in Canada: 70%.
  2. I will have received a coronavirus vaccine: 65%
  3. I will have personally contracted coronavirus infection: 20%
  4. Someone in my household will have contracted coronavirus: 20%
  5. Schools in London-Middlesex will close due to coronavirus outbreak: 30%
  6. U.S. deaths from COVID-19 > 300,000: 60%
  7. YAPCA will resume in-person activities before end of term because of lifting coronavirus restrictions: 15%
  8. Violin lessons will resume in-person before the end of term because of lifting coronavirus restrictions: 20%
  9. Daily case counts exceed 30 on any day in 2021 for London-Middlesex: 50%.

Politics

  1. Joe Biden will be elected to the U.S. Presidency: 80%
  2. Donald Trump will officially concede the election if he is defeated: 10%
  3. The U.S. Senate will change to Democratic control: 60%
  4. The U.S. House of Representatives will remain in Democratic control: 99%
  5. Joe Biden will die or become impaired in office: 10%
  6. Florida’s electoral votes go to Biden: 45%
  7. Michigan’s electoral votes go to Biden: 50%
  8. Pennsylvania’s electoral votes go to Biden: 60%
  9. Ohio’s electoral votes go to Biden: 20%
  10. Wisonsin’s electoral votes go to Biden: 40%
  11. Arizona’s electoral votes go to Biden: 55%
  12. Lindsey Graham is defeated: 30%
  13. Mitch McConnell is defeated: 10%
  14. Susan Collins is defeated: 45%
  15. Results of election are known by November 5, 2020: 60%
  16. Donald Trump attends the Inauguration ceremonies: 20%
  17. Boris Johnson is still UK PM: 60%
  18. Justin Trudeau is still Canadian PM: 70%
  19. Queen Elizabeth dies: 20%
  20. Prince Philip dies: 30%
  21. Roe v. Wade is overturned: 10%
  22. Coney-Barrett is confirmed: 100%

Family

  1. [redacted]: 70%
  2. [redacted]: 50%
  3. [redacted]: 60%
  4. [redacted]: 80%
  5. [redacted]: 30%
  6. [redacted]: 50%
  7. We own a third dog: 25%
  8. [redacted]: 20%
  9. [redacted]: 20%
  10. Any member of our immediate family household travels on an airliner: 40%
  11. Audra has a new car: 25%
  12. [redacted]: 20%
  13. Interlochen holds in-person summer camp: 40%

Russian

  1. I complete Anki reviews on 100% of days: 70%
  2. I complete Anki reviews on at least 80% of days: 80%
  3. My tutor-rated speaking ability is improved by at least 25% on a 0-10 scale: 70%
  4. I’ve read at least 6 short stories in Russian: 25%
  5. I do prosody practice on at least 50% of days: 10%

Writing

  1. I write more than 5 articles on Suzuki Experience: 40%
  2. I write more than 12 articles on Ojisanseiuichi.com: 60%

Technology/Economy

  1. I purchase a new laptop: 15%
  2. I purchase a new cell phone: 10%
  3. I set up a VPN for privacy purposes: 65%
  4. I cancel my Facebook account: 20%
  5. I check Facebook less than twice a day on 80% of days: 90%
  6. I resume using Instagram: 20%
  7. I’m using a text editor other than Sublime or Atom: 50%
  8. I unblock Twitter: 10%
  9. DJIA closes above 30,000: 60% 10 I update to new major macOS version: 60%

Personal

  1. I workout on at least 80% of days: 20%
  2. I workout on at least 50% of days: 40%
  3. I workout on at least 25% of days: 50%
  4. I take an SSRI or related medication: 30%
  5. [redacted]: 60%
  6. I sit zazen on at least 80% of days: 10%
  7. I sit zazen on at least 50% of days: 30%
  8. I sit zazen on at least 25% of days: 40%
  9. I write 2021 goals: 95%
  10. I complete all 2021 goals: 10%
  11. I complete more than 50% of 2021 goals: 50%
  12. We begin kitchen renovation: 15%
  13. [redacted]: 60%
  14. I read more than 10 books: 20%
  15. I read more than 5 books: 90%
  16. I read more than 4 novels: 15%
  17. I travel anywhere on an airliner: 10%
  18. I install radio transceiver in back lock: 60%
  19. I install USB charger outlet behind office cabinet: 25%
  20. I can play Rachmaninoff partita transcription from memory: 30%

Extracting ID3 tags from the command line - two methods

As part of a Hazel rule to process downloaded mp3 files, I worked out a couple different methods for extracting the ID3 title tag. Not rocket science, but it took a little time to sort out. Both rely on non-standard third-party tools, both for parsing the text and for extracting the ID3 tags.

Extracting ID3 title with ffprobe

ffprobe is part of the ffmpeg suite of tools which on macOS can be installed with Homebrew. If you don’t have the latter, go install it now; because it opens up so many tools for your use. In this case, it makes ffmpeg available via brew install ffmpeg.

Using variables in Keyboard Maestro scripts

Having fallen in love with Keyboard Maestro for its flexibility in macOS automation, I began experimenting with scripting in various languages, like my old favourite Perl. That’s when the fun began. How do we access KM variables inside a Perl script.

Let’s see what the documentation says:

So the documentation clearly states that this script

#!/usr/bin/perl

print scalar reverse $KMVAR_MyVar;

should work if I have a KM variable named MyVar. But, you guessed it - it does not.

Hugo cache busting

Justification

Although caching can make page loads notably faster, it comes with a cost. Browsers aren’t always capable of taking note when a cached resource has changed. I’ve noticed recently that Safari utterly refuses to reload .css files even after emptying the browser cache and clearing the web history.

Background

With a lot of help from the a pair of articles written by Ukiah Smith, I’ve developed a workflow for dealing with this problem during the deployment process. He describes two approaches to the problem of static asset caching, one an improvement on the other. I’ve implemented something like what he describes using the git file hash to modify the filename of the css files. When the client browser sees a new filename, it always reloads the resource. So the problem is to figure out how to only change the filename when the contents have changed. Let’s say you tweak a css parameter and want to ensure that client browsers load the correct version. We can use the git file hash, and append it on the filename. Then the only remaining problem is to make sure that the page head template knows how to find the correct version to bake into the pages. Here, our approach is the same as Smith’s.