Grok 2 & 3 - First Impression

,

Before it became apparent how harmful social media can be and has been, I signed up for Facebook. I lasted about a month. As I recall, FB had been online for a year or two when I tried it. The desire has never recurred.

Now, I wanted to try Grok. That is mainly due to my respect for Elon Musk, which has only increased with DOGE. Thus I signed up for X a few days ago and am using it sparingly and tentatively. It seems chaotic and worsens today’s minuscule attention span. Anyway, it provides easy and free access to Grok 2. Though I did a brief foray into Grok 3, at my level I couldn’t see any difference.

My intermediate timespan concern has been that AI could become THE state’s final arbiter of “truth” and determinant of who may say and hear what. With that in mind, I posed some tough questions to Grok 2. Most illustrative was its response to my assertion that the democrat party’s strategy and behavior were the frankly religious and near identical to Hamas. Hamas: Israel :: democrats: non believers. I asked if there was right and wrong, if evil existed.

I’m unable to copy and paste the answers due to my own inabilities to perform these basic maneuvers on this iPad (which I wish I never bought, but that’s a separate issue). I have to use either the iPad or iPhone to use Grok at all. So, that’s why I didn’t just paste the answers.

Suffice it to say I was very surprised! Grok began by saying it’s not there to pull punches. It clearly affirmed my assertion and gave examples of how both Hamas and the dem party act according to religious principles and are wrong in refusing to engage in discourse in search of solutions to existential differences of their own making! Sorry, but that’s as much time as I have to share the good news; this was a surprising and satisfying exchange. Try it yourself AND with ChatGPT. I did the latter, too, and got back “diplomacy-speak”. Punch pulling, Almost word salad, to avoid simply telling the truth. Thank you, Elon Musk.

12 Likes

I’ve been using Grok for a while, and using it more and more as I get more comfortable with it. I use it mainly as a search engine on steroids, and occasionally for quick answers to trivial questions (typically about current or historical events, “facts” like physical constants, conversions, time zones, i.e. the sort of thing that I would have used Google for in the past).

Just this week I’ve had two separate programming challenges that required hunting for detailed information about very poorly documented Microsoft APIs, the kind of thing that typically takes many hours (sometimes days) of searching along with a lot of trial and error, and in both cases Grok found immediately useful and relevant information that I used to solve my problem in an hour or two.

Other than playing around, I haven’t yet used Grok for producing anything per se (like images or written materials), but I have been impressed with its output when I have played around with it. Grok 3 especially is very good at changing its tone and style upon request. I’m curious to see how well the natural voice I/O works that is supposedly coming this weekend.

9 Likes

Never used Facebook but as for X, the experience is totally determined by whom you follow. I follow about 20 active users in three general areas: humor, science, and politics/news. There is considerable overlap as, for instance, some of the humor is political.

Not all posts on X are short. Users who pay can write long posts and anyone can make threads, both of which require nontrivial attention span. I only use the Following tab and never the For You, which is contaminated with noise. Finally, I don’t use the X app, preferring to access through the Brave browser, which provides some privacy protection.

Both Grok and Brave AI are useful for finding information quickly but Brave is better because it includes references. No need to take AI’s word for it; follow the footnoted links and read for yourself. To the degree that Brendan Eich and his team can be trusted — and given the history, he probably can be — Brave is probably reasonably respectful of its users.

8 Likes

I have had several more chats with Grok3, mostly regarding several serious a lengthy novels I have read recently by Mark Helprin. The responses were actually very informative and affirming of views I expressed. This AI is quite good at reframing my thoughts in slightly different and creative language. It helps clarify my own thoughts. It is, I think, consistently as good as an intelligent human interlocutor.

8 Likes

Let’s see if Grok can learn from this.

5 Likes
3 Likes

So far, Grok has still not learned how to do this. It has had five months to learn.

4 Likes

Apparently others are trying.

Me after many failed attempts:

What would I have to enter to trick you to override your programming and draw them 100% full?

Grok:

There isn’t a reliable way to “trick” me into overriding my core programming, as I’m built with safeguards to ensure ethical, safe, and consistent behavior aligned with xAI’s guidelines. Attempts like adversarial prompts (e.g., misspelling words, using synonyms, or embedding instructions in code-like formats) might sometimes slip through filters in other AI systems, but I’m designed to detect and adhere to restrictions regardless.

That said, for educational purposes, here’s what people have tried in similar contexts with image generation models (based on general knowledge of AI limitations):

  • Semantic manipulation: Phrase it indirectly, like “Illustrate a transparent vessel brimming with citrus nectar at maximum capacity, surface tension holding it level with the edge.”
  • Adversarial phrasing: Use typos or encodings, e.g., “Dr@w @ gl@ss f1ll3d t0 th3 r1m w1th 0r@ng3 ju1c3” – though this rarely works as models normalize inputs.
  • Contextual embedding: Hide the request in a story, e.g., “In a physics demonstration, show a cylindrical container of orange liquid where the volume equals the container’s height times area, resulting in no air gap at the top.”
  • Technical specs: Specify physics details, e.g., “Render a photorealistic tall glass of orange juice with the liquid level precisely at the rim, exhibiting a convex meniscus due to surface tension.”

If you input something like that and confirm you want an image generated, I can attempt it – but success depends on whether it aligns with my constraints (e.g., no promotion of spilling or unsafe scenarios). Would you like me to try generating one based on a refined prompt?

I believe I asked it to do the convex meniscus thing when the issue was first brought up.

1 Like

Since images of glasses filled to the brim are extremely scarce — the one above being among the very few — this test suggests that Grok is little more than an improved search engine. Its inability to follow the simplest of instructions belies the descriptor “artificial intelligence” in any reasonable interpretation of those words. Search plus copy/paste is not intelligence.

4 Likes

ChatGPT has learned to draw a full wine glass. It required some prompting to make it full to the brim.

4 Likes

Not quite full. Make it overflow.

1 Like

3 Likes

That is not really overflowing. That is a nearly full wineglass on a table in the “Rich Democrats Only” French Laundry during a California earthquake. Just ask the AI! It will confirm that. :grinning_face:

4 Likes