tspi.at

Diode based passive RF frequency doublers - the basics

2025-04-20T00:00:00+02:00

Introduction

Passive RF frequency doublers have always piqued my curiosity, particularly those capable of operating across a wide bandwidth. At first glance, the concept seemed to have to be deceptively simple—but I couldn’t initially grasp how it would be possible to generate higher frequencies purely through passive means, without resorting to phase-locked loops or active frequency synthesis techniques. These circuits manage to double the frequency of an RF signal without active components, and apparently their elegance lies in the clever use of nonlinearities.

With some time on my hands recently, I decided to explore the fundamental principles behind these devices more deeply. In this article, I aim to walk through the core concepts that make diode-based passive frequency doublers work, especially in broadband applications. Whether you’re approaching this topic from a theoretical interest or practical design perspective, I hope this summary offers a useful foundation.

For the purposes of this article, I will omit the discussion of impedance matching on the input and output sides. It’s worth noting, however, that any circuit employing biasing requires AC coupling at both the input and output ports, along with appropriate RF choke-based bias injection, as is standard practice in RF design.

A recap on the Shockley equation for diodes
The Anti-Parallel Diode pair
Odd order single diode harmonics generation
Conclusion
Appendix: Exponentials of sine functions
Appendix: The Taylor expansion of the cosh

A recap on the Shockley equation for diodes

To begin, let us consider the fundamental behavior of diodes. These devices are constructed by forming pn-junctions—typically realized as a p-well embedded within an n-well on a p-type substrate, following standard semiconductor fabrication techniques. The pn-junction inherently supports unidirectional current flow, thereby exhibiting rectifying properties.

A typical diode datasheet delineates three primary operating regions:

The conductive region arises when the diode is forward biased beyond a specific threshold voltage. Should this voltage become excessive, the associated high current can lead to thermal damage.
The isolation region corresponds to reverse bias conditions. Ideally, the diode does not conduct in this region; however, a small leakage current—typically in the nanoampere (nA) or picoampere (pA) range—is present in practical scenarios.
The breakdown region occurs when the reverse voltage exceeds a critical limit. For conventional diodes, this results in irreversible damage. In contrast, Zener diodes are intentionally designed to operate in this regime and are commonly employed for voltage regulation and overvoltage protection.

Once the forward bias surpasses the threshold voltage, the diode enters its conducting state. In large-signal scenarios, the resulting current is most accurately modeled using the Shockley equation, which incorporates the effects of carrier drift, thermal diffusion, and recombination-generation mechanisms. The current-voltage (I-V) characteristics found in datasheets often appear approximately linear due to logarithmic scaling of the current axis:

[ \begin{aligned} I &\approx I_S(T) \left(e^{\frac{U}{nU_t}} - 1\right) \end{aligned} ]

Where

$I$ is the current flowing through the diode
$U$ is the applied bias voltage
$I_S(T)$ is the temperature dependent saturation current. The saturation current is the leakage current flowing through a diode while being operated in reverse biased (blocking) mode.
U_t is the thermal voltage given by $U_t = \frac{k_B T}{q}$ with the elementary charge q. This accounts for the thermal diffusion.
The empirical emission coefficient $n$ that takes into account deviations of the real diode from the optimal model. This is mostly caused by recombination effects that reduce the Saturation current. An ideal diode would have a emission coefficient of $1$.

The relevant constants used are (in SI system):

The Boltzmann constant $k_B \approx 1.38 * 10^{-23} \frac{Ws}{K}$
The elementary charge $q = e = 1.6 * 10^{-19} As$

The Anti-Parallel Diode pair

Let’s now take a look at the current flowing over the anti-parallel connected diode pair put in series into the signal path (not as a shunt) when applying a drive voltage of $U$ - we utilize the odd symmetry flip to cancel the fundamental and odd harmonics:

[ \begin{aligned} I_{total}(U) &= I_{diode}(U) + I_{diode}(-U) \\ &= I_S(T) \left(e^{\frac{U}{n U_t}} - 1\right) + I_S(T) \left(e^{\frac{-U}{n U_t}} - 1\right) \\ &= 2 I_S(T) \left(\cosh \left(\frac{U}{n U_t}\right) - 1\right) \end{aligned} ]

We can now apply a Taylor expansion of the $\cosh$:

[ \begin{aligned} \cosh \left(\frac{U}{nU_t}\right) &= 1 + \frac{1}{2!}\left(\frac{U}{nU_t}\right)^2 + \frac{1}{4!}\left(\frac{U}{nU_t}\right)^4 + \ldots \end{aligned} ]

Absorbing constant factors in constants for each power we thus arrive at:

[ \begin{aligned} I_{total}(U) &\propto a_2 U^2 + a_4 U^4 + \ldots \end{aligned} ]

When we now assume a sine driving signal $U(t) = A \sin(\omega t)$ this yields:

[ \begin{aligned} I_{total}(U) &\propto a_2 A^2 \sin^2(\omega t) + a_4 A^4 \sin^4(\omega t) + \ldots \end{aligned} ]

When taking a look into product and exponential rules for sines (see Appendix) we can expand the exponents:

[ \begin{aligned} I_{total}(U) &\propto a_2 A^2 \frac{1}{2} \left(1 - \cos(2\omega t) \right) + a_4 A^4 \left(\frac{3}{4} + \frac{1}{4} \cos(2\omega t) + \frac{1}{8} \cos(4\omega t)\right) + \ldots \\ \end{aligned} ]

When we now drop all terms above 4th order expansion:

[ \begin{aligned} I_{total}(U) &\propto a_2 A^2 \frac{1}{2} + a_4 A^4 \frac{3}{4} - a_2 A^2 \frac{1}{2} \cos(2 \omega t) + \frac{1}{4} a_4 A^4 \cos(2 \omega t) + \frac{1}{8} a_4 A^4 \cos(4\omega t) \\ &= \left(\frac{1}{2} a_2 A^2 + \frac{3}{4} a_4 A^4 \right) + \left(-\frac{1}{2} a_2 A^2 + \frac{1}{4} a^4 A^4 \right) \cos(2 \omega t) + \frac{1}{8} a_4 A^4 \cos(4 \omega t) \\ \end{aligned} ]

We can see the total current contains:

A constant DC component
A second harmonic at $2 \omega$
A fourth harmonic at $4 \omega$
Higher even harmonics will also be present when we perform the expansion to higher levels (those terms from the expansion will also contribute to the prefactors of lower frequency components)

We can utilize this configuration without any biasing (i.e. fully passive). Each diode conducts during one of the half cycles. The fundamental cancels out naturally, the second harmonic dominates. Biasing actually would shift the symmetry.

Odd order single diode harmonics generation

So then the next question that arises - how could we generate odd harmonics? “Simply” by using only a single diode in the signal path. Let’s again start with the Shockley equation:

[ \begin{aligned} I_{total} = I_S(T) \left(e^{\frac{U}{n U_t}} - 1\right) \end{aligned} ]

Let’s again assume the driving signal $U(t) = A \sin(\omega t)$ and perform the Taylor expansion we get:

[ \begin{aligned} I(t) &= c_1 \sin(\omega t) + c_2 \sin^2(\omega t) + c_3 \sin^3(\omega t) + \ldots \\ &= c_1 \sin(\omega t) + c_2 \frac{1}{2} \left(1 - \cos(2\omega t) \right) + c_2 \left( \frac{3}{4} \sin(\omega t) - \frac{1}{4} \sin(3\omega t) \right) + \ldots \\ &= c_1 \sin(\omega t) + \frac{1}{2} c_2 - \frac{1}{2} c_2 \cos(2 \omega t) + \frac{3}{4} c_2 \sin(\omega t) - \frac{1}{4} c_2 \sin(3 \omega t) + \ldots \\ &= \frac{1}{2} c_2 + \left(c_1 + \frac{3}{4} c_2 \right) \sin(\omega t) - \frac{1}{2} c_2 \cos(2 \omega t) - \frac{1}{4} c_2 \sin(3 \omega t) + \ldots \\ c_m &= I_s \frac{A^m}{m! (n U_T)^m} \end{aligned} ]

As one can see this term includes a DC term, the fundamental as well as odd and even harmonics. This configuration is usually used with biasing to reach the exponential conductive region of the diode instead of the quasi-linear mode (and blocking during the second half wave). It is possible to operate triplers without bias for large input signals, for small signals a large asymmetric waveform and thus odd harmonics are generated - this clipping produces odd harmonics. Alternatively one can operate the diode in its exponential conductive mode - the nonlinearity again produces even and odd harmonics.

Conclusion

In summary, we have seen that diodes can be effectively employed to generate harmonic content from RF signals. Even harmonics arise naturally from configurations using anti-parallel diode pairs, which operate without the need for any external bias. In contrast, a single diode in the signal path can generate both even and odd harmonics, particularly when driven into its exponential conduction region. This approach often benefits from biasing, especially when working with low-level signals.

It is important to note, however, that harmonic generation is only part of the complete design. Practical implementations must also consider how to efficiently transfer power into and out of the nonlinear circuit. Impedance matching—whether narrowband (e.g., using Pi or L networks) or broadband (e.g., via transformers or resistive divider networks)—is crucial to minimize reflections and maximize power conversion at the desired harmonic frequencies.

Appendix: Exponentials of sine functions

Since they are often used here a short overview of multiplications and exponentials of sine and cosine functions. The relations that we are going to look in are:

Term	Equation
$\sin(x)$	$\frac{e^{ix} - e^{-ix}}{2i}$
$\cos(x)$	$\frac{e^{ix} + e^{-ix}}{2}$
$\sin(\alpha) * \sin(\beta)$	$\frac{1}{2} \left( \cos(\alpha - \beta) - \cos(\alpha + \beta) \right)$
$\cos(\alpha) * \cos(\beta)$	$\frac{1}{2} \left( \cos(\alpha + \beta) + \cos(\alpha - \beta) \right)$
$\sin(\alpha) * \cos(\beta)$	$\frac{1}{2} \left( \sin(\alpha + \beta) + \sin(\alpha - \beta) \right)$
$\sin^2(x)$	$\frac{1}{2} \left(1 - \cos(2x) \right)$
$\cos^2(x)$	$\frac{1}{2} \left(\cos(2x) + 1 \right)$
$\sin^3(x)$	$\frac{3}{4} \sin(x) - \frac{1}{4} \sin(3x)$
$\sin^4(x)$	$\frac{3}{4} + \frac{1}{4} \cos(2x) + \frac{1}{8} \cos(4x)$

So now let’s go into detail. First let’s take a look at the Euler formulation of sine and cosine and the implications for multiplications of sines and cosines:

[ \begin{aligned} \sin(x) &= \frac{e^{ix} - e^{-ix}}{2i} \\ \cos(x) &= \frac{e^{ix} + e^{-ix}}{2} \\ \sin(\alpha) * \sin(\beta) &= \frac{e^{i\alpha} - e^{-i\alpha}}{2i} * \frac{e^{i\beta} - e^{-i\beta}}{2i} \\ &= \frac{1}{4 i^2} \left(e^{i(\alpha + \beta)} - e^{i(\alpha - \beta)} - e^{-i(\alpha - \beta)} + e^{-i(\alpha + \beta)}\right) \\ &= -\frac{1}{4} \left(e^{i(\alpha + \beta)} + e^{-i(\alpha + \beta)} - \left(e^{i(\alpha - \beta)} + e^{-i(\alpha - \beta)}\right)\right) \\ &= -\frac{1}{2} \left( \frac{e^{i(\alpha + \beta)} + e^{-i(\alpha + \beta)}}{2} - \frac{e^{i(\alpha - \beta)} + e^{-i(\alpha - \beta)}}{2} \right) \\ &= \frac{1}{2} \left( \underbrace{\frac{e^{i(\alpha - \beta)} + e^{-i(\alpha - \beta)}}{2}}_{\cos(\alpha - \beta)} - \underbrace{\frac{e^{i(\alpha + \beta)} + e^{-i(\alpha + \beta)}}{2}}_{\cos(\alpha + \beta)} \right) \\ &= \frac{1}{2} \left( \cos(\alpha - \beta) - \cos(\alpha + \beta) \right) \\ \cos(\alpha) * \cos(\beta) &= \frac{e^{i\alpha} + e^{-i\alpha}}{2} * \frac{e^{i\beta} + e^{-i\beta}}{2} \\ &= \frac{1}{4} \left(e^{i(\alpha + \beta)} + e^{i(\alpha - \beta)} + e^{-i(\alpha - \beta)} + e^{-i(\alpha + \beta)}\right) \\ &= \frac{1}{2} \left(\underbrace{\frac{e^{i(\alpha + \beta)} + e^{-i(\alpha + \beta)}}{2}}_{\cos(\alpha + \beta)} + \underbrace{\frac{e^{i(\alpha - \beta)} + e^{-i(\alpha - \beta)}}{2}}_{\cos(\alpha - \beta)} \right) \\ &= \frac{1}{2} \left( \cos(\alpha + \beta) + \cos(\alpha - \beta) \right) \\ \sin(\alpha) * \cos(\beta) &= \frac{e^{i\alpha} - e^{-i \alpha}}{2i} \frac{e^{i\beta} + e^{-i \beta}}{2} \\ &= \frac{1}{4i} \left( e^{i(\alpha + \beta)} + e^{i(\alpha - \beta)} - e^{-i(\alpha - \beta)} - e^{-i(\alpha + \beta)}\right) \\ &= \frac{1}{2} \left( \underbrace{\frac{e^{i(\alpha + \beta)} - e^{-i(\alpha + \beta)}}{2i}}_{\sin(\alpha + \beta)} + \underbrace{\frac{e^{i(\alpha - \beta)} - e^{-i(\alpha - \beta)}}{2i}}_{\sin(\alpha - \beta)}\right) \\ &= \frac{1}{2} \left( \sin(\alpha + \beta) + \sin(\alpha - \beta) \right) \end{aligned} ]

Now let us apply those rules in conjunction with the fact that $\cos(0) = 1$ and $\sin(0) = 0$ to evaluate exponentials of sines:

[ \begin{aligned} \sin^2(x) &= \sin(x) \sin(x) \\ &= \frac{1}{2} \left(\cos(x-x) - \cos(x+x) \right) \\ &= \frac{1}{2} \left(\cos(0) - \cos(2x) \right) \\ &= \frac{1}{2} \left(1 - \cos(2x) \right) \\ \cos^2(x) &= \cos(x) \cos(x) \\ &= \frac{1}{2} \left(\cos(x+x) + cos(x-x)\right) \\ &= \frac{1}{2} \left(\cos(2x) + cos(0) \right) \\ &= \frac{1}{2} \left(\cos(2x) + 1 \right) \\ \sin^3(x) &= \sin^2(x) \sin(x) \\ &= \frac{1}{2} \left(1 - \cos(2x) \right) \sin(x) \\ &= \frac{1}{2} \left( \sin(x) - \sin(x) \cos(2x) \right) \\ &= \frac{1}{2} \left( \sin(x) - \frac{1}{2} \left(\sin(x+2x) + \sin(x - 2x) \right) \right) \\ &= \frac{1}{2} \left( \sin(x) - \frac{1}{2} \left(\sin(3x) + \sin(-x) \right) \right) \\ &= \frac{1}{2} \left( \sin(x) - \frac{1}{2} \left(\sin(3x) - \sin(x) \right) \right) \\ &= \frac{1}{2} \sin(x) - \frac{1}{4} \left(\sin(3x) - \sin(x) \right) \\ &= \frac{1}{2} \sin(x) - \frac{1}{4} \sin(3x) + \frac{1}{4} \sin(x) \\ &= \frac{3}{4} \sin(x) - \frac{1}{4} \sin(3x) \\ \sin^4(x) &= \sin^3(x) \sin(x) \\ &= \left( \frac{3}{4} \sin(x) - \frac{1}{4} \sin(3x) \right) \sin(x) \\ &= \frac{3}{4} \sin(x) \sin(x) - \frac{1}{4} \sin(3x) \sin(x) \\ &= \frac{3}{4} \frac{1}{2} \left(\cos(2x) + 1 \right) - \frac{1}{4} \frac{1}{2} \left( \cos(3x - x) - \cos(3x + x) \right) \\ &= \frac{3}{8} \left(\cos(2x) + 1 \right) - \frac{1}{8} \cos(2x) + \frac{1}{8} \cos(4x) \\ &= \frac{3}{4} + \frac{1}{4} \cos(2x) + \frac{1}{8} \cos(4x) \\ \end{aligned} ]

Appendix: The Taylor expansion of the cosh

Since this is something also not commonly known out of someones basic math knowledge here is a short recap on the Taylor expansion of the $\cosh$ function:

[ \begin{aligned} \cosh(x) &= \frac{e^x + e^{-x}}{2} \\ e^{x} &= \sum_{n=0}^{\infty} \frac{x^n}{n!} \\ \to e^x + e^{-x} &= \sum_{n=0}^{\infty} \frac{x^n}{n!} + \sum_{n=0}^{\infty} \frac{(-x)^n}{n!} \\ &= \sum_{n=0}^{\infty} \frac{x^n + (-x)^n}{n!} \\ &= 2 \sum_{n=0}^{\infty} \frac{x^{2n}}{(2n)!} \\ \to \cosh(x) &= \sum_{n=0}^{\infty} \frac{x^{2n}}{(2n)!} \\ &= 1 + \frac{x^2}{2!} + \frac{x^4}{4!} + \frac{x^6}{6!} + \ldots \end{aligned} ]

Recovering 18650 Lithium Cells from Damaged Battery Packs

2025-04-15T00:00:00+02:00

Introduction

Disassembling old battery packs from devices such as garden equipment, laptops, or power banks often uncovers 18650 lithium polymer cells that may be salvaged for reuse under proper precautions. These cells can be repurposed for building new battery packs for robotics projects or reassembling “zombie” battery packs. However, using cells of varying states or ages in the same pack is unsafe and poses a significant fire hazard. Such configurations should never be used in environments where safety is critical. Additionally, salvaged battery packs will void warranties and may not be covered by insurance policies.

In most cases, only one or two cells in a series chain are damaged, while the others remain functional. Parallel configurations often have more damaged cells, yet functional ones can still be salvaged. By carefully disassembling the packs and testing each cell individually, you can identify usable ones and attempt to recharge them.

However, cells that fall below a certain voltage threshold are deemed unsafe due to the risk of copper dendrites forming inside the battery, potentially causing internal short circuits. Such cells present a considerable safety hazard, as they can ignite or even explode.

Safety cannot be overstated when working with 18650 cells. Mishandling them may result in fires that cannot be extinguished with water, as lithium reacts violently upon contact with water. If you lack the necessary expertise, it is strongly recommended to avoid attempting to recover these cells. This process is inherently dangerous and should only be undertaken by individuals with the proper knowledge and equipment.

This article outlines my personal experience in salvaging and recovering 18650 cells, including how I test and charge them. It is intended as a project log rather than a comprehensive guide or recommendation.

A word on economics of this project
Used tools
Disassembly of Battery Packs
Selecting Candidates
Recovering the cells
Building Battery Packs
Cell Log Example
Proper Disposal of Unsafe Cells
Conclusion

A word on economics of this project

If you try to rescue cells out of economic reasons - don’t do it. Your time is worth more than what you are going to save if you are not doing this on a very large scale. And the induced safety risk is usually not worth it. In case you just need a working battery pack just buy a cheap one (note: this is an Amazon affiliate link, this pages author profits from qualified purchases), they are available in all different configurations for hobby applications, replacement for devices, etc. Building your own battery pack can be fun and challenging - and also interesting from a resource saving perspective - but not from a economic or safety point of view.

Used tools

Note: All supplied links are Amazon affiliate links, this pages author profits from qualified purchases

Hand held small diameter grinder (Drehmel or Proxxon) to cut open packages and the interconnection between cells
Various screwdrivers to open up packages
A multimeter to monitor voltage
A KA3005D benchtop lab power supply for current limited and voltage limited operation as well as PC readout of voltages during the recovery process
1S charge controllers including a BMS for supplying small scale microcontroller appliances
A cheap battery powered spot welder
Some permanent marker to uniquely label the cells

Disassembly of Battery Packs

The first step in recovering 18650 cells is to disassemble the existing battery packs. The construction of these packs varies significantly. Some are straightforward to open with screws, while others are sealed with glue or secured with one-time-use locking hooks. When opening these packs, it is crucial to avoid using cutting tools that could penetrate the cells, as this could lead to explosions or intense fires.

Once the pack is opened, you will often encounter arrays of batteries spot-welded together and housed in plastic spacers. To remove the nickel strips welded to the batteries, cutting discs are highly effective. However, extreme caution must be taken to avoid damaging the cells. Additionally, it is essential to prevent short-circuiting any cells or arrays. Series connections within the pack can result in significant voltages, posing a risk of electrical shock or burns caused by arcs. Short circuits can also lead to welded connections that are difficult to separate, and the resulting high discharge currents may cause explosions or fires.

When storing the recovered cells, ensure they cannot form a loop, where the positive terminal of one cell contacts the negative terminal of another. Such loops can result in high-current discharges, leading to fires or explosions. Proper storage and handling are critical to maintaining safety during the recovery process.

It’s a good idea to uniquely label the cells with your own markings though cells already come with a unique serial numbers. It’s just easier to work with ones own markings usually.

Selecting Candidates

As a first step, begin by measuring the voltage of each cell to assess its condition. This initial check is crucial for determining whether a cell is safe and worth recovering. The following voltage thresholds can help guide the selection process:

0V: Cells at this voltage typically indicate blown internal fuses. These fuses are an integral safety feature designed to protect against catastrophic failures, and bypassing them is strongly discouraged. While some cells may have resettable fuses (CID/PTC). CID stands for Current Interrupt Device, which acts as a pressure-activated fuse to disconnect the internal cell components if internal pressure builds up dangerously, usually due to overcharging or overheating. PTC stands for Positive Temperature Coefficient, a resettable fuse that increases resistance as the temperature rises, reducing current flow and preventing overheating. These fuses can sometimes be reactivated, allowing you to re-measure the battery. However, it is important to understand that fuses usually blow for a valid reason, such as internal damage or overcurrent. Reactivating a fuse without addressing the underlying issue may lead to significant safety risks, including fire or explosion.
Below 2.0V: Cells in this range are likely damaged and considered unsafe due to potential copper dendrite formation and internal short circuits. Copper dendrites form when a cell is undercharged for prolonged periods, as the low voltage conditions lead to the electroplating of copper from the current collector. These dendrites can pierce the separator within the cell, causing a direct short circuit between the anode and cathode. Attempting to recharge such cells can trigger thermal runaway or explosions due to the internal short and resulting heat buildup. These cells often exhibit increased heating during charging, which serves as a potential warning sign. Additionally, even after the charging current is removed, they may remain warm or continue to discharge internally due to the self-sustaining current path created by the dendrites, until they are fully discharged. This behavior further underscores their instability and the associated risks. When attempting to reuse such cells, always monitor their temperature closely. Only conduct this process in a controlled environment equipped to manage potential explosions or intense fires, as these cells can fail catastrophically.
2.0V to 3.0V: These cells are heavily discharged but may still be recoverable. However, they are often below specification and may have reduced capacity due to the degradation of the electrode material and loss of active lithium ions, which chemically occurs when a cell remains in a deep discharge state for extended periods. Prolonged exposure to such conditions increases internal resistance and reduces the cell’s ability to hold charge. Additionally, most standard chargers are designed not to recharge such cells for safety reasons. While battery management systems typically prevent cells from discharging to dangerously low levels, damages to the battery pack or prolonged storage can still lead to such low voltages.
3.0V to 3.7V: This range represents a normal discharged state. Cells within this range are usually in good condition and can be safely recharged. However, some standard chargers may automatically attempt to charge such cells without verifying their condition, potentially masking underlying issues. Always test these cells thoroughly before use.
3.7V to 4.2V: Cells in this range are at a typical storage level (around 3.7V) or fully charged (4.2V). These are generally safe and in good working order.

Always handle cells with care during testing. Use a multimeter or dedicated battery analyzer to accurately measure voltage. Never attempt to charge or use cells that fall into the unsafe categories, except if you fully understand the risks and are prepared to manage catastrophic cell failure (not only during charging but also during subsequent operation). Properly recycle cells that you do not plan to reuse to minimize environmental impact and prevent hazards.

Recovering the cells

Phase 1: Very low current, 3.3V target voltage

To begin recovering cells, start by charging them very slowly to a safe baseline voltage of approximately 3.3V. This process must be performed using a constant current power supply, such as a benchtop lab power supply, to ensure precision and safety. For this purpose, set the voltage to 3.3V and the current limit to 18mA. This configuration allows for a controlled constant current charging process until the cell reaches the desired voltage.

Throughout this process, monitor the cell’s temperature carefully. Use a multimeter to periodically measure the voltage and confirm the power supply’s settings. If the cell begins to heat up significantly during this phase, it is likely unsafe due to internal discharge paths, as described earlier. In such cases, immediately discontinue charging and allow the cell to cool down before disposing of it properly.

Precautions: Always perform this recovery process in a controlled and safe environment, away from flammable materials. Ensure the cell is placed on a non-conductive and heat-resistant surface like a stone slab, and securely fixate the cell to prevent it from being ejected in case of an explosion. Maintain a clear zone free of flammable materials, and keep a fire extinguisher rated for lithium fires (Class D orspecialized lithium battery extinguisher) nearby for added safety but expect to just let an burning cell burn out fully. Regular monitoring with a multimeter helps ensure that the charging voltage and current remain within safe parameters. If you notice any unusual behavior, such as excessive heating or voltage fluctuations, the cell should be considered unsafe and discarded appropriately. Proper disposal ensures environmental safety and reduces the risk of accidental hazards.

Phase 2: Using a Standard Charge Controller up to 1A

In the second phase, connect the cells to a standard charge controller for charging. It is crucial to use a charger that limits the charging current to 1A or less for safety reasons. Set the target voltage to 4.15V and continuously monitor both voltage and temperature throughout the process.

If the voltage stagnates significantly below 4.15V, it may indicate internal discharge paths within the cell. In this case, the charger’s energy is dissipated internally, rendering the cell unsafe for further use. Additionally, while it is normal for cells to warm slightly during this phase, a rapid or excessive temperature increase is another clear sign of internal shorts. Cells that fail to reach the target voltage or become excessively hot during charging should be disposed of immediately.

If a cell successfully reaches 4.15V but remains hot even after disconnecting from the charger, it is also likely unsafe due to internal short circuits. Such cells should not be used and must be disposed of properly.

Phase 3: Monitoring Voltage Over Time

Once the cells have been charged to 4.15V, allow them to rest for at least two weeks before conducting further evaluations. After this period, measure their voltage again. A significant drop in voltage indicates internal shorts or self-discharge, making the cell unsafe for reuse. These cells should be properly recycled to avoid potential hazards.

For reference, the typical self-discharge rate of 18650 lithium cells in good condition is minimal, with a drop of only a few millivolts over several weeks. Any noticeable deviation from this behavior should be treated as a warning sign. Additionally, cells that were previously below 2.0V at the start of recovery remain at a higher risk of instability, even if they appear to function normally. When building battery packs, only use cells in non-critical applications where the risk of failure does not pose a significant hazard, such as in fire- or explosion-prone environments.

Building Battery Packs

Now that you have a collection of (hopefully working and recovered) 18650 cells, you can begin building battery packs. This process requires careful consideration of safety and proper design principles to ensure the pack functions reliably.

Compatibility and Safety: Never use recovered cells in circuits not specifically designed to handle lithium cells. Lithium-ion cells have very high discharge currents, which is one of their key advantages, but this also makes them dangerous in unsuitable circuits. Always ensure that charge controllers are not only designed for 18650 cells but also match the specific lithium chemistry (e.g., lithium cobalt oxide or lithium iron phosphate). Chemistry mismatches can result in improper charging profiles, which can lead to overheating or even catastrophic failure. If you are using single-cell packs for microcontroller projects, such as with ESP8266 or ESP32 boards, low-cost charge controllers and voltage regulator boards are readily available.

Connecting Cells: When constructing larger battery packs or designing for higher currents, it is essential to use proper connectors. Avoid using spring-loaded battery holders for high-current applications, as these can introduce large contact resistances, leading to overheating and potential fire hazards. Instead, connect cells using nickel strips that are spot-welded to each other. The thickness of the nickel strips must be appropriate for the target current; if needed, multiple layers can be spot-welded together to increase current capacity. Never solder directly onto the cells, as this introduces extreme thermal stress, creating both immediate and long-term risks of fire or explosion.

In a series configuration, the total pack voltage is the sum of the individual cell voltages, but the current capacity remains that of a single cell. Any imbalance in cell performance or aging will disproportionately affect the weakest cell, potentially leading to overcharging or over-discharging, both of which pose serious risks. Conversely, in parallel configurations, the voltage remains constant across all cells, while the current capacity is the sum of all the cells. Here, differences in internal resistance or aging between cells can lead to uneven current distribution, with weaker cells contributing less and possibly overheating. Understanding these differences is critical for designing packs that are both safe and efficient.

Balancing Leads: When building multi-cell battery packs, always include balancing leads for the charge controller. Balancing ensures that all cells in the pack maintain equal voltage levels during charging and discharging, preventing individual cells from overcharging or undercharging. The charge controller monitors the voltage of each cell individually, detecting imbalances and correcting them. This is achieved either by discharging cells with higher voltage into ballast resistors or by transferring excess energy into an inductor (tank element) that redistributes it to less charged cells.

In serial configurations, balancing is particularly critical as all cells must maintain near-identical voltage levels to avoid overcharging or over-discharging of individual cells, which could lead to failure or fire. Parallel configurations are less sensitive to such imbalances since the cells naturally share the load and voltage, but differences in cell quality can still result in uneven energy distribution. Proper balancing prevents these issues and ensures the longevity and safety of the battery pack.

High-Current Connectors: For high-current applications, use connectors designed to handle the required load. Examples include XT30 connectors for currents up to 30A and XT60 connectors for currents up to 60A. Proper connectors minimize resistance and ensure safe operation under load.

Temperature Monitoring: If you are designing an appliance with an integrated battery pack, consider adding temperature sensors to monitor the pack during operation. For small packs, a single sensor may be sufficient, while larger packs may require multiple sensors placed at critical locations. Monitoring temperature can provide early warnings of overheating, allowing you to shut down the system or reduce the load before a failure occurs.

Cell Log Example

The following log has been taken while recovering a few cells from old garden equipment to illustrate how a typical flow could look like:

Date	Cell	Start voltage	Charge current phase 1	Target voltage phase 1	Charger phase final voltage	Conclusion and comments	Useable	Checkup Date	Checkup Voltage	Result
01-20	1	2.4V	37 mA	3.3V	4.15V	Interrupted charger phase once at 3.7V	Most likely	04-14	4.13V	Ok
01-23	10	1.9V	18 mA	3.3V	4.15V		Most likely	04-14	4.08V	Ok
01-23	9	2.25V	18 mA	3.2V	4.09V	got pretty hot at charger phase, take a second look	Cautious	04-14	3.77V	Ok (stay cautious )
01-23	8	1.58V	18 mA	3.3V	4.12V		Most likely	04-14	3.94V	Ok
01-24	6	1.88V	18 mA	3.3V	3.9V	got pretty hot at charger phase, never reached 4V	Cautious	04-14	3.67V	Ok (stay cautious)
01-24	3	1.48V	18 mA	3.3	4.14V		Most likely	04-14	4.05	Ok
01-25	4	0.939V	18 mA	3.3V	4.16V		Most likely	04-14	4.12V	Ok
01-25	5	0V					CID blown?	04-14	0V	CID blown most likely, dispose
01-25	7	0.095V	18 mA	3.3V	4.04V	got very hot at the end	Cautious	04-14	0.813V	Dead, dispose
01-26	2	2.5V	18 mA	3.3V	4.16V		Most likely	04-14	4.12V	Ok

As one can see from the above log most cells that looked good during recovery kept their charges very well. The cells that got very hot showed very high discharge - a sign for formation of copper dentrites that lead to higher self discharge rates and thus rapidly increasing degradation of the battery. Those batteries impose a potential safety hazard - they are prone to exploding spontanously so they should be disposed as soon as possible or at least stored at a safe location. In addition I personally would not mix cells marked cautious (i.e. who got warm during the recovery charging phase) with potentially good ones in the same battery pack. It goes without saying that any pack built with such batteries should never be operated without supervision or in an area that could not tolerate cells exploding.

Proper Disposal of Unsafe Cells

Cells that are deemed unsafe—such as those showing signs of internal shorts, excessive heating, or deep discharge (especially below 2.0V)—should never be reused or stored with other batteries. Before disposing of such cells, ensure that the terminals cannot accidentally short-circuit: covering both ends with non-conductive tape (e.g., electrical or Kapton tape) is strongly recommended during collection and transport. Store these cells in a fireproof container away from flammable materials until they can be taken to a certified hazardous waste collection center or battery recycling facility. Never throw lithium cells into general household trash, as they pose a fire risk even when fully discharged (actually they tend to explode inside waste collection trucks and threatment facilities from time to time inflicting serious damage). Always treat potentially unstable cells with caution—lithium batteries can fail violently days or even weeks after initial damage or mishandling.

Conclusion

Recovering 18650 lithium cells from damaged battery packs can be a rewarding process for those with the necessary experience, tools and respect for the risks involved. It allows for the reuse of valuable resources and can support hobby electronics projects in a cost-effective and environmentally conscious way. However, safety must always be the top priority—working with lithium cells carries serious hazards, from fire to explosion, especially when dealing with unknown or degraded components. Always test and monitor recovered cells rigorously, never mix questionable cells with good ones, and avoid using them in critical or unattended applications. When in doubt, prioritize safe disposal over potential reuse. This process is not a shortcut to cheap power, but a methodical and cautious exercise in extending the life of otherwise discarded components.

Overridable Library Code with Weak Linkage in ANSI C (GCC Microcontroller Edition)

2025-04-11T00:00:00+02:00

When developing embedded libraries for microcontrollers, you often want to provide default behavior that users can easily override in their application code. This is where weak linkage comes into play—a GCC extension that allows you to define functions or variables in a way that they can be replaced at link time.

This article is mainly a quick note to remember how this works and shows how to use weak linkage for:

Functions (like user-defined callbacks or hooks)
Global variables (like configuration values)

This short blog article walks through examples, shows how it works under the hood, and wraps up with a Makefile-friendly AVR demo project.

What Is Weak Linkage?

In GCC (used by avr-gcc), you can mark a symbol as “weak” so that it will only be used if no strong symbol of the same name is found. By default, all symbols are treated as “strong” unless explicitly marked otherwise. This means the application can override the default without any special tricks.

__attribute__((weak)) void my_hook_function(void) {
    // Default implementation
}

If the application defines my_hook_function without __attribute__((weak)), the linker will use the application’s version instead. This substitution happens during the linking phase, which occurs after the compiler has built all individual object files. That means you can compile and cache your library object files with weak symbols and then override selected parts later in application code without needing to rebuild the library.

However, it’s important to understand how the linker treats unused weak symbols. If the weakly defined function or variable is not referenced by any other code in the binary, it may be omitted entirely by the linker—particularly when using --gc-sections in the linker and -ffunction-sections and -fdata-sections during compilation.

These flags work together to enable fine-grained section-level control:

-ffunction-sections and -fdata-sections instruct the compiler to place each function and data object into its own uniquely named section in the .o file (e.g., .text.myfunc, .data.myvar) instead of placing all functions in .text and all data in .data.
--gc-sections then allows the linker to discard entire sections that are not used. Crucially, this means that if nothing in a section is referenced—no functions, no variables, no labels—the entire section can be safely removed from the final binary.

This behavior is what makes weak linkage especially powerful in embedded development: you can provide many default implementations without inflating binary size, as unused defaults will not be linked unless actually referenced.

In this context, “actually referenced” means that the weak symbol is used in a way that causes its section to be retained during linking—either because it is directly called or accessed, or because it is not overridden by a strong symbol in the application and is needed as the default implementation.

This means that even if a weak symbol is defined in a library, it won’t end up in the binary unless something references it—either directly or indirectly. This is ideal for libraries that want to stay lean and only include what’s used.

To summarize:

If a weak symbol is not used, it will be dropped by the linker.
If a weak symbol is used but not overridden, the default will be included.
If a weak symbol is overridden, the application’s strong symbol will replace it.

Therefore, a well-structured library should:

Mark default implementations as weak
Reference them from active logic only when needed
Declare them in headers for user override

This pattern enables powerful and modular application behavior while minimizing binary size and avoiding code bloat.

1. Weak Functions

Let’s consider a typical example in embedded development: your library implements UART communication, and it includes a handler function that gets called whenever a character is received via an interrupt. You want to provide a default implementation (e.g., echoing the character), but let the user of the library override it with custom logic.

A example situation might look like this:

ISR(USART_RX_vect) {
    char c = UDR0; // Read received character
    uart_on_rx(c); // Call hook function to handle it
}

In this example, the interrupt service routine (ISR) calls a handler function uart_on_rx, which is defined as a weak symbol in the library. If the user doesn’t provide their own implementation, the default will be used.

Example Problem: You’re writing a logging library for a small AVR-based sensor node. The node uses UART to send messages to a serial console. By default, every character received should be echoed back. However, in one application, the user wants to count how many characters were received instead. You don’t want them to touch the library source code—just override a hook.

Library Code

// libuart.c
__attribute__((weak)) void uart_on_rx(char c) {
    // Default: echo
    printf("LIB: received char: %c\n", c);
}

Application Code

// main.c
void uart_on_rx(char c) {
    printf("APP: got %c from UART!\n", c);
}

The uart_on_rx function in the application replaces the weak one from the library.

2. Weak Global Variables

Sometimes you want to provide configurable parameters in your library—such as buffer sizes, feature enable flags or other compile-time constants—that can be changed by the application using your library. Weak global variables are perfect for this use case.

Example Situation: Imagine your library defines the size of a circular receive buffer for UART input. Most applications are fine with 64 bytes, but some need 128 or more to avoid overflows when data comes in bursts. Rather than forcing users to modify the library or add complex configuration headers, you simply define the buffer size as a weak global variable. This allows the application to override it by defining the same symbol.

This approach is simple and does not require function hooks or pointers. It also works nicely when paired with initialization logic that allocates buffers or configures peripherals based on the variable’s value.

Library Code

// libuart.c
__attribute__((weak)) uint16_t uart_rx_buffer_size = 64;

Application Code

uint16_t uart_rx_buffer_size = 128;

The application’s uart_rx_buffer_size definition will take precedence over the weak default, allowing the user to tune the system without modifying the library.

3. Function Pointers

Sometimes you want to let the application decide the behavior of a handler at runtime—not by overriding a symbol at link time, but by assigning function pointers during initialization, swapping them at runtime or even exchange them depending on the current state of the system. In these cases, traditional function pointers are the better solution.

Example Situation: You’re building a modular library where different UART channels or modes might use different handlers for received characters. Instead of forcing the user to override a fixed symbol, you define a function pointer in your library and allow the user to assign any function they like to it.

This approach is more flexible and dynamic than weak linkage and doesn’t rely on linker behavior. It also avoids symbol conflicts or the need to match names exactly.

Library Code

void default_hook(char c) {
    // Default behavior
    printf("Default received: %c
", c);
}

void (*uart_rx_hook)(char) = default_hook;

void uart_received_char(char c) {
    if (uart_rx_hook) uart_rx_hook(c);
}

Application Code

void my_uart_rx_handler(char c) {
    // Custom logic
    printf("Handled by app: %c
", c);
}

int main(void) {
    uart_rx_hook = my_uart_rx_handler;
    uart_received_char('B');
    return 0;
}

Makefile-Friendly AVR Demo

To bring all the concepts together—weak functions, weak variables, and modularity—we’ll walk through a complete build system example using a Makefile. This project builds a static library from the libuart.c file with appropriate compiler and linker flags that support section-level optimization. The final application links against this library, overriding weak symbols as needed.

This setup mimics what you would do in a real AVR embedded project: compile a reusable library once, cache the object or static archive, and allow the application layer to customize behavior through overrides or assignment—without touching the library source code.

File Structure

project/
├── Makefile
├── lib/
│   ├── libuart.c
│   └── libuart.h
└── main.c

`libuart.h`

#ifndef LIBUART_H
#define LIBUART_H
#include 
extern uint32_t uart_baudrate;
void uart_on_rx(char c);
void uart_init(void);
void uart_simulate_receive(char c);
#endif

`libuart.c`

#include "libuart.h"
#include 

__attribute__((weak)) uint32_t uart_baudrate = 9600;
__attribute__((weak)) void uart_on_rx(char c) {
    printf("LIB: received char: %c\n", c);
}

void uart_init(void) {
    printf("LIB: init UART at %d baud\n", uart_baudrate);
}

void uart_simulate_receive(char c) {
    uart_on_rx(c);
}

`main.c`

#include "lib/libuart.h"
#include 

uint32_t uart_baudrate = 19200;
void uart_on_rx(char c) {
    printf("APP: got %lu from UART!\n", c);
}

int main(void) {
    uart_init();
    uart_simulate_receive('A');
}

`Makefile`

MCU = atmega328p
CC = avr-gcc
AR = avr-ar
OBJCOPY = avr-objcopy
CFLAGS = -mmcu=$(MCU) -Os -Wall -ffunction-sections -fdata-sections
LDFLAGS = -Wl,--gc-sections
LIBDIR = lib
LIBSRC = $(LIBDIR)/libuart.c
LIBOBJ = $(LIBDIR)/libuart.o
LIB = $(LIBDIR)/libuart.a
TARGET = main

all: $(TARGET).hex

$(LIB): $(LIBOBJ)
	$(AR) rcs $@ $^

$(LIBOBJ): $(LIBSRC)
	$(CC) $(CFLAGS) -c $< -o $@

$(TARGET).elf: main.c $(LIB)
	$(CC) $(CFLAGS) $(LDFLAGS) $^ -o $@

%.hex: %.elf
	$(OBJCOPY) -O ihex $< $@

clean:
	rm -f $(LIBDIR)/*.o $(LIBDIR)/*.a *.elf *.hex

This akefile compiles the libuart.c source into a static library (libuart.a) with -ffunction-sections and -fdata-sections enabled. The application then links against the library using --gc-sections to ensure unused sections (e.g., unused weak symbols) are discarded.

Summary Table

Symbol Type	Use Case	Overridable?	Notes
Function	User callbacks/hooks	✅ Yes	Most common use
Global Variable	Configuration options	✅ Yes	Use in lib, override in app
Function Pointer	Late-bound custom behavior	✅ Yes	More dynamic, use with care

Conclusion

Using weak linkage is a powerful idiom for embedded C, especially in the AVR world. It lets you build clean, reusable libraries that users can customize without forking or patching. From error handlers to baud rates to interrupt hooks, this technique gives your libraries flexibility while keeping the design clean and robust.

Evolution of Internet Content Creation Across Generations (1993–Present)

2025-04-05T00:00:00+02:00

In the following article, we explore how patterns of internet content creation have evolved over time, focusing on content depth, publishing speed, and the lifespan of information. What began as a decentralized landscape of thoughtful webpages, personal blogs, and hand-crafted forums has transformed into a high-speed stream of short videos and algorithmically surfaced reactions.

From vibrant DIY communities to passive consumption, the evolution of content has brought remarkable convenience—but also a quiet collapse of depth and longevity. If you’ve ever wondered why the internet feels faster but emptier, this long-form analysis connects the dots between cultural shifts, technical developments, and changing human behavior. We also compare these digital trends to earlier eras, such as the age of amateur book-writing and zine distribution, to examine what has been gained and what may have been lost along the way.

Pre-Internet Era (~pre-1990)
Early Web Era (1993–1999)
Blogosphere and Forums (2000–2006)
Social Media Emergence (2007–2012)
Mobile and Algorithmic Era (2013–2018)
Compression and Passive Era (2019–Present)
Conclusion

📚 Pre-Internet Era (~pre-1990)

Mediums: Books, zines, newsletters, amateur radio logs.
Motivation: Deep expertise, self-expression, storytelling, or science communication.
Publishing Speed: Slow (weeks to years).
Content Depth: Very high.
Information Lifespan: Decades or longer.
Distribution: Print and mail networks; difficult to reach wide audiences without formal publishing.

Before the internet, content creation was largely a deliberate and deeply personal endeavor. Individuals who wished to share knowledge or tell stories often did so through print media—books, zines, newsletters, or even typed and photocopied pamphlets. Publishing was not instantaneous; it required time, effort, and often financial investment. Many creators were motivated by a passion for their subject, whether it was amateur science, philosophy, technology, or local history. This era gave rise to the “amateur expert” who might write a self-published guide on telescope building, maintain a home-circulated zine on science fiction, or print booklets filled with poetry or opinion.

These forms of media were typically created with the intent of permanence. A book or printed manual was designed to last decades, to be referenced over and over. The slowness of the publishing process was compensated by the depth and care put into each piece. Even local newsletters might contain meticulously researched articles, edited and reviewed by the author over weeks.

Distribution was also different. Without the internet, authors relied on mail-order lists, clubs, local bookstores, or word-of-mouth. In radio or scientific circles, call signs and address books acted as the backbone of social discovery and network-building. It was common for enthusiasts to develop long-term correspondence with others around the world, trading ideas and printed material by post.

This era cultivated a culture of longevity and expertise. Content was not made to entertain for seconds; it was made to inform, archive, or inspire for years to come.

🌐 Early Web Era (1993–1999)

Mediums: Personal websites (Geocities, Tripod), mailing lists, FTP sites.
Motivation: Personal knowledge sharing, technical exploration, digital identity.
Publishing Speed: Moderate (HTML editing, FTP uploads).
Content Depth: High – often multi-page articles or curated collections.
Information Lifespan: Long if preserved; much content lost to link rot.
Distribution: Decentralized and user-hosted.

The early web era marked the first time individuals could easily publish their thoughts, research, hobbies, or opinions to a global audience without traditional gatekeeping. This was a golden age of exploration and creativity, where users were often their own webmasters, designers, and editors. Many early adopters treated their personal websites as a digital extension of themselves. Topics ranged from Linux configuration guides and UFO theories to literary analysis and personal diaries. Content was often raw, hand-crafted, and sincere.

What made this era unique was the combination of openness and decentralization. Many people hosted their own websites using space provided by their internet service providers (ISPs), which commonly offered a few megabytes of web space as part of the subscription. Other widely used services included Geocities, Tripod, and Angelfire—platforms that allowed users to create webpages without needing to understand the complexities of hosting. These services were community-oriented, often organized into themed “neighborhoods,” and encouraged browsing and discovery among like-minded users.

Publishing during this time involved manually writing HTML, uploading files via FTP, and often debugging pages line-by-line. Despite the hurdles, the creative freedom was enormous. Sites could be built from scratch, without imposed layouts or algorithms, and could contain any kind of content the creator wished to share. Long, interconnected pages, image galleries, JavaScript experiments, and hand-coded CSS flourished.

It is often forgotten today that the very first web browser—WorldWideWeb, later renamed Nexus—was not just a viewer but also a web editor. Users could create and edit pages directly within the browser interface, and with the right permissions, upload content directly to a server without any additional tools. This spirit of read/write interaction was baked into the early vision of the web itself, emphasizing participation rather than passive consumption.

The result was a diverse and vibrant web that mirrored the personalities and curiosities of its creators. While much of this content has since disappeared due to expired hosting, inactive domains, and service shutdowns, its legacy persists in the idea that the internet should be a place for personal expression, long-form storytelling, and digital ownership.

Search engines during the early web era were a far cry from the sophisticated algorithms of today. They relied primarily on simple keyword-based indexing, and relevance ranking was minimal or entirely absent. Prominent early engines like AltaVista, Lycos, WebCrawler, and Excite would crawl web pages for visible text and basic metadata, but lacked semantic understanding or link analysis.

It was common practice—and often necessary—for site owners to manually submit their websites to each search engine they wanted to be listed in. Some engines even had a delay of days or weeks between submission and index inclusion. As a result, discovery was inconsistent, and traffic from search engines was not guaranteed.

These limitations also encouraged webmasters to maintain “links” pages, functioning almost like curated search indexes for niche communities. It was not uncommon for people to navigate the web primarily through webrings and bookmarks, rather than by typing queries into a search bar. The culture of discovery was as much social and manual as it was algorithmic.

📩 Blogosphere & Forums (2000–2006)

Mediums: Blogs (LiveJournal, Blogger, WordPress), forums, fan communities.
Motivation: Narrative sharing, technical tutorials, fandom activity, niche discussion.
Publishing Speed: Faster (minutes to hours).
Content Depth: Medium to high – essays, tutorials, serialized stories.
Information Lifespan: Moderate to long, depending on hosting stability.

The rise of blogs and forums between 2000 and 2006 marked a critical shift in how content was created and who was creating it. Where the early web required direct manipulation of HTML and hosting infrastructure, new platforms lowered the barrier for entry. People who were curious and expressive but lacked technical expertise could now participate. The internet began attracting not just technically skilled individuals, but also enthusiasts, hobbyists, and creatives from broader backgrounds—nerds who may not have known how to code, but who had something to say.

Forums exploded in popularity thanks to technologies like Perl CGI scripts and PHP-based platforms such as phpBB and vBulletin. These tools enabled the creation of community-driven message boards that became digital homes for people with shared interests—from anime and astronomy to political theory and DIY electronics. Each forum was a microcosm with its own culture, etiquette, and long-standing users. Moderation was handled by volunteers, and threads were often long, rich in discussion, and openly archived.

Simultaneously, blogging platforms like LiveJournal, Blogger, and eventually WordPress made it easy to publish structured, time-stamped posts without setting up a full website. These platforms were powered by PHP and MySQL backends and often offered WYSIWYG editors. Importantly, hosting providers began to include PHP and MySQL as default features in even their most basic plans, further lowering the barrier to entry. The simplicity of the technology meant that even school pupils could write and deploy their own forums or blogging engines—one of the coauthors of this article, for example, created several such forums during this period as a teenager. This combination of immediacy, accessibility, and empowerment encouraged a new kind of content: serialized thoughts, personal narratives, opinion pieces, and detailed tutorials. While raw technical instruction still thrived, it was joined by introspection, commentary on daily life, and community-driven writing challenges or memes.

This era is notable for the openness with which people shared personal details. Anonymity was often partial or nonexistent; users might publish under pseudonyms, but they shared photos, locations, and deeply personal stories. Importantly, this was not due to technical limitations—the technology to remain anonymous was readily available—but rather a cultural choice. Many users felt comfortable disclosing personal information, reflecting both a trusting internet culture and a general lack of awareness about the long-term consequences and visibility of online content.

Fandoms, niche communities, and longform thought thrived. People wrote not just about media, but created in its shadow: fanfiction, analyses, shipping manifestos. Others blogged about their careers, mental health journeys, or political awakenings. It was a moment of rich self-expression and peer-based interaction, where even highly personal writing could find a sympathetic and engaged audience.

Search engines matured significantly during the blogosphere and forum era. The sheer volume of newly created content, much of it generated by non-technical users, prompted the development of more advanced indexing and ranking strategies. Google, which had already begun gaining dominance in the early 2000s, introduced PageRank and link-based ranking as core components of its search algorithm, enabling more meaningful discovery based on how content was interlinked across the web.

Unlike the early web, where creators often had to manually submit their websites to search engines, this era saw the rise of automatic crawling, faster indexing, and the integration of sitemaps and RSS feeds. Blog platforms like WordPress and Blogger automatically generated structured metadata, making it easier for search engines to understand content structure and freshness.

Keyword relevance was still essential, but it was now weighed alongside factors like inbound links, update frequency, and content originality. The rise of SEO (Search Engine Optimization) began here as bloggers and web admins started tailoring content to improve visibility.

Search became the primary method of content discovery for many users. While forums and blogrolls still helped foster community-based exploration, a growing number of users accessed content almost exclusively through search engines. This shift not only changed how content was distributed but also influenced what was written, as creators increasingly optimized for discoverability.

In short, the transition into the blogosphere era marked the point at which search evolved from a simple lookup tool into a dominant force shaping content visibility and relevance.

This period also saw the beginning of more serious legal and regulatory attention to online content. Previously, the web was largely inhabited by technically proficient users who maintained a self-regulating culture—resolving disputes within communities, moderating forums themselves, and often ignoring formal legal structures. But as more casual users and business interests entered the space, the content ecosystem became more complex and visible. Lawmakers and lawyers began to focus on how the internet was being used: copyright issues emerged with user-shared media, privacy debates intensified as personal information was posted publicly, and questions around liability and defamation became more common.

This transition marked a cultural shift, where the once insular and technically self-sufficient internet community now had to contend with external oversight. The simplicity and openness that enabled rapid growth and experimentation also introduced new risks as the internet moved closer to mainstream life.

Mediums: Facebook, Twitter, Tumblr, YouTube.
Motivation: Social sharing, audience-building, identity performance.
Publishing Speed: Instantaneous.
Content Depth: Mixed – many short posts, some longer-form videos or text.
Information Lifespan: Short (feed-based discovery).
Distribution: Centralized platforms with increasing algorithmic curation.

The social media era, beginning around 2007, marked a seismic shift in how individuals approached online expression and communication. Platforms like Facebook, Twitter, Tumblr, and YouTube enabled people to broadcast content instantly to growing audiences, not just publish it for personal reflection. This change in motivation meant that content was increasingly created not simply to document one’s life (as with blogs), but to influence others, provoke reactions, or maintain social visibility. It became performative and strategic.

The nature of the content itself evolved rapidly. Posts became shorter, more spontaneous, and more emotionally charged. The average length of content dropped significantly, from multi-paragraph blog posts to single-sentence updates or 140-character tweets. In turn, the expected pace of content updates accelerated dramatically. Where blogs might be updated weekly, users on social media began to feel compelled to post multiple times per day. The ephemeral nature of feeds meant that content was rapidly buried unless it was amplified by user reactions or algorithms.

Centralization played a critical role in this shift. While the earlier web was spread across self-hosted sites and diverse platforms, social media consolidated attention into a few isolated silos. Each platform had its own rules, norms, and technical limits, often discouraging portability or deep-linking across platforms. These walled gardens made discovery easier within each service, but limited the broader interconnectivity and archivability of content.

Search engines play a very different and significantly diminished role in the age of social media. Unlike blogs or traditional websites, most social media platforms are designed as walled gardens. Their internal content is not easily accessible to external indexing, and public-facing posts are often transient or buried behind login requirements and dynamically generated interfaces. Once a post is no longer trending or receiving amplification from the platform’s algorithm, it effectively disappears from visibility unless a user knows exactly where to look.

This design severely limits the ability of search engines to index or surface older content, making historical discovery and long-term referencing difficult or impossible. It also means that the social media experience becomes tightly controlled by platform dynamics, reinforcing the illusion that the current trending topics or sentiments reflect a permanent or universal status quo. Short-lived phenomena, sudden shifts in sentiment, and bursts of emotional or political mobilization are hard to revisit or trace retrospectively. This undermines archival continuity and the ability to track the evolution of discourse over time.

As a result, what is not actively promoted may as well be forgotten, and this influences both the production and consumption of content within social media ecosystems.

The fast pace and brevity of content also encouraged more emotionally reactive posts. Nuance was lost as shorter formats led to more polarized and visceral expression. Language became sharper, more informal, and often more hostile. Comment sections, replies, and quote-posting tools made it easy to misinterpret tone or escalate disagreements. While this emotional intensity felt new to many, it was not without precedent: early internet spaces like Usenet already demonstrated how unmoderated discourse could become aggressive or insulting. In those environments, users would often rely on “killfiles”—client-side filters that automatically ignored posts from certain individuals. However, such self-directed tools were never natively available in modern web-based forums or social media platforms.

The immediacy of interaction created an environment where careful reflection and editing were often sacrificed in favor of speed. Blocking mechanisms, originally conceived as tools for protecting oneself from harassment or abuse, have also evolved. In earlier communities, blocking typically served to shield users from persistent personal attacks. Today, it is increasingly used as a means of exerting social or political power—removing opposing viewpoints, signaling disapproval, or curating ideological bubbles. Thus, blocking and ignoring have shifted from being purely defensive actions to tools of visibility control and performative exclusion.

Crucially, this era introduced immediate feedback loops. Likes, retweets, comments, and shares provided instant gratification—or instant disapproval. This feedback became central to users’ experience, playing into dopamine-driven behavioral reinforcement. People began to post in ways that would maximize engagement, creating content tailored not just for self-expression, but for reaction. This had a major psychological impact: users developed expectations for constant interaction and validation. The addictive quality of these systems cannot be overstated.

Earlier technologies like blog pingbacks had attempted to build a web of conversation between sites, allowing one blog post to reference and notify another. However, these decentralized feedback systems never became widely adopted and were eventually co-opted by spammers. Social media replaced this idea with a centralized, immediate, high speed mechanism that privileged speed over substance.

As social media matured, the tone of public conversation changed. High-volume, short-format communication made miscommunication more common and introduced a level of emotional volatility not previously seen at scale. With fewer editorial filters and more instant visibility, the nature of what got published—and why—shifted profoundly.

This shift also altered how politics and law approached online discourse. Governments and legal institutions began to take growing interest in regulating digital spaces, recognizing an increase in criminal behavior such as threats, harassment, and the unauthorized publication of sensitive content like private images. The rapid-fire nature of social media posts, especially those charged with negative emotion, also heightened concerns about the manipulation of public opinion. Unlike the slower, deliberative influence of political blogs, short and emotionally provocative messages could quickly spark outrage and drive reactive behaviors across entire communities. This opened the door to new forms of coordinated disinformation, polarization, and social destabilization, prompting lawmakers in many countries to push for stricter moderation and accountability policies.

In this climate, the potential of short-form social media content to catalyze mass reactions has become a major topic of political discourse, legal frameworks, and digital platform policy.

📲 Mobile & Algorithmic Era (2013–2018)

Mediums: Instagram, Snapchat, TikTok (early), Vine, Stories.
Motivation: Visual identity, ephemeral updates, virality.
Publishing Speed: Real-time.
Content Depth: Low – visual and minimal text.
Information Lifespan: Hours to days; designed for forgetfulness.
Distribution: Highly centralized and algorithm-driven.

During the mobile and algorithmic era, content creation underwent another transformation, shaped by smartphones, high-speed mobile networks, and the growing dominance of algorithmic recommendation engines. The format of content became overwhelmingly visual, animated, and ephemeral. Platforms such as Snapchat, Instagram Stories, Vine, and early TikTok emphasized vertical video, looping motion, and swipe-based navigation. Posts were often designed to vanish after a set time, with some platforms explicitly deleting content after 24 hours unless saved manually.

Content was presented in real-time, with feeds personalized and reordered by opaque algorithmic systems. Instead of a chronological list, users received what the algorithm predicted they would engage with. This accelerated the lifecycle of content: a post not boosted within minutes or hours was often buried permanently. At the same time, content became harder to search, index, or reference. Platforms offered limited internal search, and external search engines were unable to crawl or link into these ecosystems. In many cases, even users themselves were unable to find content they had seen earlier unless it had been bookmarked or saved.

The declining content length was also mirrored by a shrinking attention span. Content that exceeded a few seconds or required sustained engagement saw significantly less interaction. This favored short, visually striking, emotionally potent formats. Subtlety, nuance, or depth were actively penalized by user behavior and platform design alike.

Interestingly, although the dominant aesthetic of this era was that of casual, on-the-spot spontaneity—content that appeared to be filmed quickly during everyday life—the reality was often the opposite. Professional influencers and content creators increasingly invested vast amounts of time, money, and production resources to create what looked like amateur, improvised content. Entire teams handled lighting, scripting, editing, and algorithmic timing to optimize reach and emotional resonance.

This focus on “appearing casual” plays a significant psychological role. Viewers unfamiliar with the digital content industry often perceive influencers as relatable peers or friends, assuming they are speaking from authentic personal experience. This illusion of intimacy and honesty generates a deep level of trust, even when the content is effectively scripted marketing. As a result, many users engage with influencer content as if it were social interaction rather than carefully orchestrated branding.

This had deep implications for digital memory and discourse. With no long-term archiving and limited discoverability, content became disposable by design. Cultural trends, emotional reactions, and even public sentiments could shift overnight, but were nearly impossible to trace after the fact. In contrast to blogs or even early social media, where content might be referenced, quoted, or archived, this era left behind few retrievable footprints.

Since influencers often target emotionally loaded topics relevant to the values, anxieties, and aspirations of their target communities, they gain deep and strategic access to what those audiences are inclined to believe or desire. This form of targeted communication, combined with the illusion of personal connection, enables influencers to shape opinions, reinforce group identity, and subtly guide decision-making. The bond formed between follower and influencer—fueled by the appearance of authenticity—creates a powerful dynamic where manipulation can occur without the follower recognizing it as such. The influencer is perceived not as a marketer or content strategist, but as a trustworthy peer or even a friend. As a result, entire groups of people may engage with and amplify messages that are in fact commercially or politically orchestrated, rather than independently formed expressions.

In this landscape, search engines became almost irrelevant. The closed nature of these platforms and their dynamic, personalized interfaces rendered traditional web crawling ineffective. Content had to be surfaced by in-app algorithms to be visible at all. This made the online experience increasingly shaped by opaque, proprietary recommendation systems rather than user-driven discovery. As a result, understanding broader patterns, historical shifts, or emerging discourse became vastly more difficult for users, researchers, and archivists alike.

Additionally, the walled garden nature of mobile-era platforms contributes to a widespread misunderstanding of how filter bubbles actually function. Contrary to the common belief that radicalized users are isolated from serious or balanced information, the reality is more nuanced: users in private, radicalized groups often do encounter mainstream perspectives—but actively reject them. The more consequential effect is on average users, who are excluded from seeing what takes place within these closed, radicalized communities. Because search engines cannot access or index content from private groups, stories and sentiments circulating in those echo chambers remain largely invisible to the broader public.

This asymmetry distorts perception. To the general population, these radical bubbles seem not to exist at all, while their participants remain highly informed—albeit selectively—about mainstream discourse. This is the actual shape of filter bubbles today: they are not walls preventing radicalized users from seeing out, but rather fences that prevent others from seeing in. As such, the broader social understanding of what narratives are circulating, and where, becomes fragmented and obscured.

🔻 Compression & Passive Era (2019–Present)

Mediums: TikTok, Instagram Reels, YouTube Shorts, AI-generated snippets.
Motivation: Reach, performance metrics, content monetization.
Publishing Speed: Instantaneous.
Content Depth: Very low – micro-content and trends.
Information Lifespan: Often measured in hours unless re-boosted.
Distribution: Platform-locked ecosystems; user behavior dominated by passive scrolling.

In the compression and passive era, beginning around 2019, user behavior shifted further toward consumption over creation. While content creation tools have never been more accessible, fewer and fewer users actively produce content. Instead, a smaller class of centralized, professional creators dominates visibility, as casual users adopt a primarily passive role. This dynamic has led to a content landscape where performance metrics and algorithmic favorability shape both form and message.

The format of content in this era is defined by extreme brevity and instant engagement: micro-videos, soundbites, looped clips, and highly polished visuals that fit within seconds of viewing time. Platforms such as TikTok, Instagram Reels, and YouTube Shorts encourage users to scroll continuously through algorithmically-curated feeds. Content is often only visible for minutes or hours unless it is boosted by a platform’s internal mechanisms, and most posts disappear from public attention rapidly. In many cases, posts are also auto-deleted, designed to leave no trace.

This increasingly favors content that is emotionally charged, visually stimulating, and quickly consumable. The expectation of instant entertainment and reaction further discourages depth, reflection, or narrative continuity. As a result, amateur and hobbyist creators are often drowned out by professional media operations that understand how to manipulate timing, trends, and metadata.

Compounding this, AI has become a core tool for professional content creators. From automated video editing and thumbnail generation to full script writing and trend prediction, AI enables efficient mass production of targeted media designed for high engagement. AI systems are also used to simulate the “amateur aesthetic” to improve relatability and trust, further blurring the line between personal content and strategic marketing.

On the consumption side, AI increasingly plays a role in information filtering—not only by assisting users in search or summarization, but by driving the very selection of what is shown. Unlike classic platforms such as Facebook, where users curated their own networks, newer platforms rely almost entirely on AI-driven feed curation. This hands control over exposure entirely to the platform, often aligning with corporate or political objectives rather than individual interests.

This convergence of centralized production, passive consumption, and AI-driven filtering results in a media ecosystem where spontaneous, decentralized, and meaningful discourse is harder to initiate and even harder to sustain.

Conclusion

This timeline highlights a progressive and multi-faceted transformation in how content is created and consumed online. What began as an ecosystem centered around deep, durable, and often self-hosted content has increasingly shifted toward short-lived, ephemeral, and algorithmically-curated media. Although technological barriers to entry have decreased—making it easier than ever to publish content—the quality, depth, and informational value of that content has markedly declined.

What we see today is a dominance of brevity over substance. The shift toward short-form content has favored formats that trigger fast emotional reactions at the cost of thoughtful exploration or critical analysis. Emotional volatility, sensationalism, and misinterpretation are not just more common—they are rewarded by the systems that determine what people see. Nuanced discussions and reflective essays are easily drowned out by bite-sized videos, eye-catching graphics, and emotionally loaded soundbites. This has led to a cultural environment where information is flattened, context is reduced, and deep understanding is often replaced by rapid impressions.

The consequence is not just shorter attention spans, but a real reduction in how much knowledge is transferred and retained. Conversations that once might have unfolded over a series of blog posts or detailed forum threads are now reduced to fleeting moments of visibility in an endless feed. Moreover, the algorithmic curation of content means that what is seen is not what is most accurate or informative, but what is most engaging in the moment—often through provocation rather than insight.

This evolution marks not merely a change in technology, but a profound cultural and cognitive shift. As platforms increasingly prioritize performance metrics over substance, the internet risks becoming a place where emotional resonance overshadows factual depth, and where communication becomes more about reach than meaning.

In Defense of Imagination: Why AI Art Is Not Theft, and What It Enables

2025-04-04T00:00:00+02:00

In recent years, the rise of generative AI tools like Stable Diffusion, Midjourney, and large language models (LLMs) such as ChatGPT has sparked widespread debate, especially within the arts community. Critics often accuse these tools of “stealing” from human artists, of being built on the unpaid labor of creatives, and of threatening the livelihoods of illustrators, writers, and performers. While these concerns arise from real anxieties about automation and digital reproduction, they also risk misunderstanding both the nature of copyright and the transformative potential of AI as a tool for human expression.

Art Is More Than Execution
Is AI Plagiarizing Artists?
AI Enables New Forms of Creation
What AI Can’t Replace
What AI Can Replace
Conclusion: Expression for the Many, Not the Few

Art Is More Than Execution

At its heart, art is not just a matter of technique or manual execution. It is about intention, emotional resonance, storytelling, symbolism, and the deep exploration of human experience. The tools an artist uses—be it oil paint, a camera, a synthesizer, or now an AI model—are merely mediums through which the core artistic impulse is expressed. Replacing a paintbrush with an AI image generator does not eliminate the human from the process; it transforms the nature of interaction between idea and medium.

There are countless people who hold rich, vibrant fantasies and stories within their minds but lack the technical skills to draw, compose, or animate them. For these people, AI represents not theft, but liberation. It becomes a way to express what was previously locked away. Like a translator for dreams, these models allow the untrained creator to give form to their inner worlds.

Is AI Plagiarizing Artists?

The most common argument against AI art is that it is trained on copyrighted materials and therefore must be plagiarizing or replicating the works of human artists. But this misunderstands how AI models function. Generative models do not memorize and reproduce existing artworks like a photocopier. Instead, they build statistical representations of visual and textual patterns and learn how to generate new combinations using similar structures, motifs, and compositional logic. These outputs are not replicas, but novel syntheses, often created by sampling from a latent space using controlled randomness—such as temperature settings in language models or denoising steps in models like Stable Diffusion. Just like a human artist who creates based on previous knowledge and influence, AI combines past patterns in new ways to imagine something that did not previously exist. If AI merely memorized and regurgitated copyrighted works, it would be useless for creative tasks—and indeed, would fail to generalize in the way these tools demonstrably do.

While some edge cases of model behavior may resemble interpolation or recombination of specific artists’ works, this is no more theft than a student painting in the style of Van Gogh or a writer mimicking Hemingway. Style is not copyrightable; only specific expressions are. The existence of genre, homage, parody, and pastiche in traditional art forms shows that influence and inspiration are inevitable—and often celebrated—aspects of creative culture.

AI Enables New Forms of Creation

AI does not replace the desire to create—it expands it. For the first time in history, a person with a disability, or someone without access to expensive tools or formal training, can bring their vision to life. People who could never afford a professional illustrator can prototype their own comic book universe. Children can design characters from their dreams. Writers can visualize alien cities in seconds, helping them refine their worlds.

This isn’t a replacement for artists. It’s a new layer of expression—a democratization of imagination. Professional artists still hold the edge in vision, coherence, narrative development, taste, and nuance. But the playing field has widened. We are now inviting more people into the act of creation, and that should be seen not as a threat, but a triumph.

Throughout history, artists have stood at the forefront of cultural, scientific, and technological revolutions. When humanity discovers something new, scientists seek to understand it, engineers strive to utilize it, and artists find ways to weave it into our cultural consciousness. Whether it was the discovery of perspective during the Renaissance, the invention of photography, or the arrival of digital media, artists have always found ways to adapt, reinterpret, and elevate new tools into expressions of the human spirit. AI is just the latest frontier in that long tradition.

What AI Can’t Replace

True art is not merely the output; it’s the context, the lived experience, and the personal voice. An AI can remix, reframe, and reshape—but it cannot feel. It cannot (at this stage of development) grow up in a certain culture, experience loss, love, or longing, and channel that into a painting. It cannot (at this stage of development) form a thesis, protest injustice, or make a statement on behalf of a marginalized community with the authenticity of lived experience.

Artists are not being replaced; they are being challenged to adapt and to explore what makes their work distinctly human. In a world where anyone can generate an image, the meaning, message, and emotional truth behind a work of art become even more valuable. Art that moves us will always come from a place AI cannot reach.

What AI Can Replace

That said, AI will inevitably replace certain types of work—especially those that are more about execution than creative ownership. Jobs such as illustrators who take client requests to realize someone else’s vision, or copywriters who turn outlines into fleshed-out articles, are at clear risk. These roles often involve tasks that AI excels at: quick generation, stylistic mimicry, and scalable production.

This is not a trivial concern. The people working in these fields have valid fears about the foundation of their economic survival being undermined. Many cannot simply pivot into new jobs overnight. Just as automation reshaped manufacturing and mechanization transformed agriculture, this wave of technological change will bring painful disruption.

But history also shows us that every industrial or technological leap brings both loss and creation. The people who once dug trenches were replaced by backhoes. Water carriers disappeared with the rise of indoor plumbing. Entire classes of workers who cleaned up after horses vanished when cars took over the streets. In every case, new forms of work eventually emerged—requiring different skills, offering new opportunities, and sometimes birthing entirely new industries.

It is vital, therefore, not to dismiss the pain of transition, but also not to conflate technological evolution with cultural decay. AI is not eliminating human creativity—it is shifting the boundary of where it begins and how it expresses itself. And crucially, it is society’s responsibility—not the individual’s alone—to ensure that those whose work becomes obsolete are not abandoned. A truly compassionate and functional society catches those affected by change, supports retraining and reintegration, and ensures that people are not left behind to struggle in poverty and despair. Technological progress must be accompanied by social structures that value human dignity over pure efficiency.

Conclusion: Expression for the Many, Not the Few

AI is not the end of art. It is the beginning of a broader form of expression. Just as photography once sparked backlash from traditional painters, and synthesizers from classical musicians, AI will face resistance. But history shows us that new tools do not erase creativity—they amplify it.

AI will also reshape many domains that were previously considered stable or immune to automation. From management and scientific research—where AI can help generate ideas, hypotheses, and simulate virtual experiments—to customer service, administrative roles, financial planning, controlling, logistics, secretarial work, and personal assistance, many traditionally white-collar roles will be transformed or replaced by automated systems. These shifts are not merely about replacing repetitive tasks, but about rethinking entire workflows through the lens of augmentation and automation.

Let us not reduce art to mere technical execution. Let us instead embrace the infinite, chaotic, surreal, symbolic realms AI makes accessible to those previously left without a voice. And let us remember that what makes art endure is not the tool used to make it, but the soul behind it.

Expanding GPU Capabilities on Notebooks and Mini PCs Without PCIe Slots via M.2 NVMe Slots

2025-03-30T00:00:00+01:00

Introduction

Modern AI, machine learning, and scientific computing demand high-performance GPUs, but many notebooks and mini PCs lack traditional PCIe slots for expansion. While it is possible to add one or even multiple GPUs to these systems using M.2 NVMe-to-PCIe adapters and PCIe expansion boards, this approach is mostly practical for retrofitting under specific constraints. For general-purpose or high-performance use, a standard desktop system with a full-size case and proper PCIe slots remains the more robust and scalable solution. This article details how to achieve this, potential challenges, and how to optimize power and connectivity for multi-GPU setups.

Why Expand GPUs on a Notebook or Mini PC?

Notebooks and mini PCs are compact and energy-efficient, but their limited expandability due to the lack of PCIe slots often restricts their computational potential. However, adding high-end GPUs with large dedicated memory in a modular fashion can significantly enhance their capabilities by enabling:

Performing scientific computations and modeling—for example, in quantum physics.
Running Large Language Models (LLMs) and Stable Diffusion locally.
Accelerating custom image processing tasks.
Enhancing machine learning workloads.
Performing large-scale finite element simulations (FEM).

The Hardware Setup

To connect GPUs to a system without standard PCIe slots, we utilize:

M.2 NVMe to PCIe x4 Adapter – Converts an NVMe slot into a PCIe x4 interface.
PCIe x1 to 4x PCIe x1 Expansion Board – Allows connecting up to four GPUs via risers.
External ATX Power Supply – Ensures sufficient power delivery for GPUs (e.g., ATX PSU with breakout board for GPUs like A100s).

The actual hardware components in our build are (note: all links provided are Amazon affiliate links, this pages author profits from qualified purchases):

Mini PC (Ιntel Alder Lake-N97) with an M.2 NVMe slot.
M.2 NVMe to PCIe x4 riser.
PCIe x1 to 4x PCIe x1 expander board.
GPUs: Starting with a budget-friendly RTX 3090 Ti 24GB (or in case of very low budget an RTX 3060 12GB), with potential for up to 4x A100s featuring 80GB VRAM per card—totaling up to 320GB VRAM, suitable for most modern applications.
External 1200W PSU to power multiple GPUs (A single A100 requires approximately 300W, bringing the total power requirement for four GPUs to about 1200W).
A power switch adapter that powers up the external power supply whenever the 12V rail of the main computer is energized. This synchronizes the two power supplies.
A HDMI dummy plug to circumvent mining lock at graphics cards to which we do not attach displays.
Some thermal paste when mounting heat spreaders or fans

The following photograph shows a hacked Zombie setup based on an Intel Alder Lake-N97, an additional 400W ATX power supply (with bridged power on pin) as well as an NVidia RTX 3060 with 12 GB VRAM:

Of course one should think about a proper enclosure when not mounting inside a server rack or similar - and ensure proper airflow.

Step-by-Step Installation

Step 1: Identify M.2 Slot

Most modern mini PCs have at least one NVMe M.2 slot (key M). To ensure compatibility, check if your system’s BIOS supports PCIe bifurcation or external GPU detection. However, many budget-friendly systems allow utilizing the PCIe lanes of the NVMe slot even if they do not explicitly support PCIe bifurcation.

Before purchasing hardware, ensure that your system has an available M.2 slot. Discovering that your only M.2 slot is occupied by a storage device after acquiring all components can be frustrating.

Step 2: Connect M.2 NVMe to PCIe Adapter

Insert the M.2 NVMe to PCIe adapter into the mini PC’s M.2 slot. These adapters generally offer either a PCIe x4 connector or a hardware x16 connector that operates with up to four lanes. Some models even utilize two M.2 slots to provide an x8 connection. To extend the connection, attach a PCIe riser cable, which not only facilitates the use of multiple GPUs but also allows you to test if the system detects the additional PCIe host controller before making significant hardware investments. Simply connecting the PCIe riser will help verify whether the system properly recognizes the host controller.

A setup utilizing the PCIe expansion board is often used in the cryptocurrency mining world. In that domain, it’s common to chain multiple stages of PCIe expanders to connect dozens of GPUs per host.

Step 3: Install the GPU(s)

Insert the GPU into one of the available PCIe slots. Some GPUs may refuse to function correctly if all PCIe lanes are not populated or if no display is attached. This restriction, commonly referred to as a mining lock, is intended to prevent cryptocurrency mining but unfortunately also limits the usability of such devices for scientific computing and media processing. There are dummy plugs that you can utilize to emulate an attached display though.

Proper mounting and adequate spacing are essential to avoid overheating. Additionally, ensure a mechanically stable setup, as troubleshooting hardware contact issues alongside software-related problems can be highly frustrating.

Step 4: Powering the GPUs

First a word of caution: Warning: Always take precautions when working with high-wattage PSUs. Improper grounding or overloads can damage hardware or pose electrical risks. Also only manipulate high wattage devices or line powered devices when you are qualified to do so. They impose a significant risk that could lead to lethal injuries. Also note that working without proper grounding may destroy components by electro static discharge.

In most cases, an external ATX PSU is required to ensure stable power delivery to the GPUs. Mini PC systems and notebooks generally come with power supplies limited to under 100W, which is insufficient for powering a dedicated GPU. Since these GPUs are powered independently from the main system while maintaining a common ground through PCIe, the PSU must be manually enabled before the PC starts. One approach to achieve this is by bridging the ATX power supply’s enable pin to ground. Alternatively, a more convenient solution is using an adapter to synchronize power activation with the main computer’s 12V rail, provided the mini PC or computer includes a 12V rail connection.

It is also important to note that many graphics adapters cannot be reset without a full power cycle, especially during system reboots. This is a well-known limitation in virtualization systems but can also present challenges in this setup. If automated power control is not available, manually cycling the GPU’s power may be required to restore functionality.

To prevent connectivity issues, ensure that all PCIe power connections are secure, including those for the expansion board and GPUs.

Step 5: Configure Software

Install the appropriate GPU drivers, whether NVIDIA or AMD, based on your hardware. If performing GPU computations, consider installing the CUDA toolkit for NVIDIA GPUs.
If required, enable external GPU support in the BIOS to ensure proper functionality.
Use PCI device listing tools to verify the presence of the host bridge and CPUs. Additionally, utilities such as nvidia-smi can help confirm GPU detection and monitor performance.

Challenges and Workarounds

BIOS and PCIe Detection Issues: Some systems may not recognize external GPUs via M.2 slots. Possible solutions include:
- Updating the BIOS (though some BIOS versions intentionally blacklist unsupported PCIe devices in NVMe slots - then updating will not work most likely).
- Enabling Above 4G Decoding in BIOS. This setting allows the system to address PCIe devices requiring large memory allocations beyond the 4GB limit of traditional 32-bit addressing. It is particularly important when using multiple GPUs, as each GPU demands dedicated memory-mapped I/O space. Without this enabled, some GPUs may not be recognized or function properly.
- Check if the slot actually supports NVMe - or only SATA. This approach does not work on SATA only ports.
Limited Bandwidth on PCIe x1: PCIe x1 bandwidth is restrictive for certain applications, particularly graphics-intensive workloads.
- Running LLMs and machine learning tasks is still feasible since these workloads primarily rely on internal GPU memory once the model is loaded. But note that loading of models takes about 16 times longer than on a proper port. If you use multiple GPUs in parallel on the same x1 port this multiplies with the number of GPUs - so using 4 GPUs in parallel means 64 times slower loading of the models.
- Cryptocurrency mining is another use case where PCIe bandwidth is less critical.
- Direct GPU-to-GPU memory transfers can be leveraged to minimize data movement through the system’s PCIe bus. This can be done utilizing proper memory copy routines in your OpenCL or CUDA code instead of always transferring synchronously to host memory.
Power Constraints:
- A single A100 GPU consumes between 300-400W, meaning a four-GPU setup could require anywhere from 1.2kW to 1.6kW of power. If your power supply cannot handle this load, you may experience power failures and system instability.
- Proper power distribution is crucial to prevent overloading and to ensure stable operation. The power supply type plays a role in this; some require an even distribution of power across all rails, while single-rail supplies do not face this issue.
- For multi-rail power supplies, ensure that you do not exceed the maximum power draw on any individual rail to avoid shutdowns or component failures.
Heat Management:
- The substantial power consumption of high-end GPUs results in considerable heat output, making effective cooling solutions essential. Underestimating the heat generated can lead to system instability and reduced hardware longevity, so proper thermal calculations should not be overlooked.
- Implementing additional case fans or liquid cooling systems can help maintain optimal thermal performance and prevent overheating.

Conclusion

We tested our dual RTX 3060 (24GB useable with applications that are capable of splitting their workload on two GPUs) configuration with:

Stable Diffusion for image generation.
GPT-based LLM inference (running models like DeepSeek-R1:70b or mistral-large:123b).
Custom FEM simulations and math algorithms (like QR decomposition, GMRES, SVD decomposition, Eigenvector calculations, etc.)

Results demonstrated significant performance improvements over CPU-only processing, despite limited PCIe bandwidth.

Expanding GPU capabilities on a notebook or mini PC is not only feasible but also a cost-effective solution. By utilizing the right adapters, power supplies, and configurations, even an affordable mini PC can accommodate up to four high-end GPUs for AI and scientific computing. With a slightly larger investment, the expansion potential can scale almost indefinitely per host.

For deeper insights into the mathematics behind computations that you can speed up, check out our related articles on machine learning, finite element simulations, and numerical methods.

Architecting Intelligence: A Comprehensive Guide to LLM Agent Patterns and Behaviors

2025-03-28T00:00:00+01:00

Large Language Models (LLMs) are powerful machine learning systems trained to understand and generate human-like text. While a single prompt can often yield impressive results, building agents - systems that use LLMs alongside external tools, memory, and structured logic—unlocks a much broader range of capabilities. Agents are not just one-shot responders; they can reason iteratively, invoke tools, collaborate with other agents, and maintain memory over time.

To use LLMs efficiently in this context, we rely on a series of structured techniques known as agent orchestration patterns. These patterns go beyond simple prompt design and represent entire workflows, architectural layouts, or behavioral blueprints that help coordinate multiple reasoning steps, tool interactions, or even interactions among multiple specialized agents.

For example, an LLM agent might:

Search the web to fact-check its own answer
Break a complex task into subtasks, each handled by a different agent
React to events like incoming messages or time-based schedules
Collaboratively write reports, summarize research, or generate code with high reliability

In this article, we explore these infrastructure-level reasoning patterns, grouped by intent and behavior. Whether you’re building tools for creative exploration, automation, scientific research, or long-running agents with memory and feedback, these patterns are essential for scalable, efficient, and powerful LLM-driven systems.

Overview of patterns
Stepwise Iteration Patterns
Research-Oriented Patterns
Introspective and Self-Evaluative Patterns
- Self-Critique / Reflective Iteration
- Chain-of-Verification
Planning and Multi-Agent Patterns
Human-in-the-Loop Feedback
- Approval Gating
- Prompt Debugging Loops
Memory-Aware Iterative Reasoning
- Rolling Window Context
- Long-Term Scratchpad
Simulation-Based Patterns
- Roleplay Simulation
- Environment-Agent Loop
Autonomous Invocation Patterns

Overview of patterns

The following table provides a quick overview and comparison of the patterns discussed in this article:

Category	Pattern Name	Primary Purpose	Structure Summary	Key Components	Memory Usage	Requires tools (function calling)	Best Use Cases
Stepwise Iteration	ReAct (Reason + Act) Loop	Tool augmented reasoning	Thought → Action → Observation → Repeat	LLM, Tool interfaces	Rolling context	Yes	Search agents, math solvers, code with testing
	Tree of Thought (ToT)	Explore multiple reasoning paths	Branch → Score → Expand/Prune	LLM (proposer + scorer)	Rolling	No	Planning, design space exploration, logic puzzles
	Self-Ask with Search	Clarify and retrieve unknowns	Generate sub-Q → Lookup → Integrate into main answer	LLM, search API	Stateless	Yes	Fact-based QA, synthesis
Research-Oriented	Multi-Hop Retrieval	Layered fact gathering	Query → Get → Ask More → Get → Integrate	Retriever, fact extractor agents	Rolling / Long	Yes	Journalism, research writing, complex queries
	Tool-Triage	Route queries based on intent	Classify → Route to tool/agent	Classifier agent, Tool wrappers	Stateless	Yes	Assistant routers, general QA
	Dynamic Query Reformulation	Reformulate weak queries	Generate variants → Search → Compare → Merge	LLM, search + merger agent	Stateless	Yes	Research, noisy input queries
Introspective / Self Evaluative	Self-Critique	Improve quality and reduce hallucinations	Generate → Critique → Revise (→ Repeat)	Critique prompt / agent	Rolling context	Optional	Essay writing, sensitive output pipelines
	Chain-of-Verification	Independent verification	Answer → Justify → Verify → Confirm/Refute	Generator + verifier agent	Stateless	Optional	Auditing, legal, medical, finance
Planning & Multi-Agent	Planner-Executor	Decompose + specialize	Planner → Plan → Executors run steps	Planner + Executors	Long or scratchpad	Yes	Complex workflows, coding, reports
	Debate / Argumentation	Explore trade-offs	Agent A ↔ Agent B → Evaluator	Opponent agents + Judge	Rolling	Optional	Ethics, design, scenarios
	Specialist Ensemble	Distribute tasks to experts	Router → Specialists → Merge	Multiple expert agents + router	Shared or rolling	Optional	Document pipelines, media workflows
Human Feedback	Approval Gating	Require human judgment	Agent → Propose → Human gate → Proceed	Human feedback + agent	Long / rolling	Optional	HR, compliance, safety-sensitive pipelines
	Prompt Debugging Loops	Tune system over time	Log → Analyze → Refine prompt	Developer + prompt debugger agent	Long-term	Yes	Dev cycles, RAG systems, performance analysis
Memory-Aware Reasoning	Rolling Window Context	Simulate short memory	Retain N steps → Discard older	Context filter / buffer	Rolling	Optional	Chatbots, short-horizon planners
	Long-Term Scratchpad	Persist knowledge across sessions	Store → Recall → Use → Update	Memory store, retriever	Long term	Yes	Project memory, assistants, tutoring
Simulation-Based	Roleplay Simulation	Scenario-based exploration	Agents with roles → Interact / respond	Persona agents	Shared or long	Optional	Training, policy, multi-view scenarios
	Environment-Agent Loop	Real-time adaptive behavior	Env State → Agent Action → New State	Agent + external system	Scratchpad / long	Yes	Robotics, games, home automation
Autonomous Invocation Patterns	Autonomous Invocation	Event/time-triggered reasoning	Trigger → Reason → Act / Schedule	Event monitor + LLM agent	Long-term	Yes	Alert handling, system watchers, async orchestration

Stepwise Iteration Patterns

Stepwise iteration patterns are designed to handle tasks that cannot be solved effectively in a single pass. These patterns introduce controlled loops where the agent evaluates, adjusts, and continues based on prior outputs. This makes them particularly useful for solving complex, ambiguous, or multi-stage problems such as multi-step reasoning, creative exploration, interactive planning, or dynamically adjusting tool usage.

The core idea behind all stepwise patterns is to let the model think and act multiple times - each time either refining a previous result, generating a new direction, or integrating new information. Depending on the pattern, the agent might call the same LLM instance repeatedly or switch between different agents or models with specialized capabilities. The prompts themselves may be adapted dynamically, and the context might evolve either by expanding (e.g., appending thoughts or observations) or trimming (e.g., using a rolling window or embedding relevance filters).

This iterative structure opens up the possibility of non-linear workflows and improved robustness. For example, rather than writing an answer all at once, the agent might first plan, then search for facts, then compose a result. Or, it might explore multiple possible answers in parallel before choosing the best one. LLM-based agents are generally much more reliable and accurate when working in small, traceable steps rather than attempting to leap to a final answer in a single generation. These stepwise processes also encourage better argumentation, richer explanations, and more transparency—since the model can describe what it’s doing at each stage. These strategies are foundational to many advanced agentic systems, and we’ll explore their individual forms in the sections below.

ReAct (Reason + Act) Loop

Structure: Thought → Action (Tool) → Observation → Repeat
Used For: Step-by-step planning with tool usage
Example: Query → search(Google) → result → write summary → ask clarifying question → repeat

The ReAct pattern (short for Reason + Act) is one of the foundational structures for building stepwise agentic reasoning with external tool usage. It allows an LLM to alternate between internal thought (reasoning) and outward actions (tool invocations), creating an effective feedback loop that enables multi-step problem-solving.

In a typical ReAct loop, the model first generates a Thought, often in the form of a reflective inner monologue like “I need to look up this information before continuing.” This is followed by an Action, such as calling a web search, calculator, database, or custom API. Once the tool has returned a result, the agent integrates that tool output into its context as an Observation. Importantly, observation is not a formal response from the tool—it is the agent’s re-evaluation of its updated context, incorporating what was returned. The loop then continues, generating a new Thought in light of the new information.

This method provides a transparent, traceable sequence of reasoning steps and actions. Instead of attempting to answer everything in one go, the agent carefully builds context and gathers necessary knowledge as needed.

Common use cases for ReAct include:

Interactive web search agents (e.g., ask a question → search → summarize → refine)
Step-by-step math solvers (e.g., break down equations, perform calculations, verify)
Coding agents (e.g., plan function → write code → test → fix errors)
Planning tasks like trip builders, schedule assistants, or workflow designers

ReAct shines in scenarios that require careful tool use, dynamic decision-making, or explainability. It has also been widely adopted in research for its flexibility and modularity, serving as a base layer for many more advanced agent architectures.

Tree of Thought (ToT)

Structure: Generate multiple reasoning paths in parallel → Score → Expand/Prune
Used For: Complex planning, creative reasoning
Variants: Depth-first, Breadth-first with pruning
Useful Tools: Scoring functions, token budget management

The Tree of Thought (ToT) pattern builds on the idea that for many complex problems, multiple possible reasoning paths exist—and exploring them can significantly improve outcomes. Instead of generating a single answer, the LLM begins by branching into several different candidate thoughts or approaches to the problem. These branches can represent different hypotheses, plans, interpretations, or creative directions.

To generate these paths, the model is typically sampled multiple times with slightly different prompt contexts or with increased temperature, which introduces variability into the outputs. Each branch is treated as a possible “thought path” that can be further expanded in subsequent steps.

Once multiple paths have been generated, an external scoring mechanism is often used to evaluate them. This scoring process might involve another LLM acting as a judge, prompted specifically to rank, critique, or score the quality, usefulness, or plausibility of each path. The scoring LLM typically receives each branch in isolation or in pairs for comparison and returns a score or ranking. More structured approaches may use rule-based filters or apply a numerical cost/benefit analysis to each path.

Based on the scores, the framework then expands the most promising branches by continuing the reasoning in the same style, or prunes less useful or redundant ones to conserve token budget and reduce complexity. This process is often repeated for several iterations, allowing a dynamically growing tree of thought that explores a rich solution space while maintaining computational efficiency.

ToT is especially powerful for tasks that involve planning, creativity, or decision-making under uncertainty. Example applications include generating multiple design ideas before refining one, outlining several research hypotheses and selecting one to pursue, or solving math word problems by trying different paths of reasoning and choosing the most coherent outcome.

Self-Ask with Search

Structure: Ask clarifying sub-question → search/compute → integrate into main answer
Used For: Queries requiring decomposed fact-finding
Example: “Who was the UK Prime Minister in 1980?” → sub-query → search → integrate

The Self-Ask with Search pattern is designed to enhance the LLM’s ability to answer complex questions by explicitly prompting it to generate clarifying sub-questions and then perform external lookups or computations to answer them. This pattern breaks down a potentially ambiguous or knowledge-intensive question into smaller parts that can be resolved independently.

The process typically begins with the LLM analyzing the main question and identifying parts that require additional factual information. It then formulates one or more clarifying sub-questions. These sub-questions are concise and precise queries aimed at retrieving a specific piece of missing information. For example, when asked “What company did Steve Jobs lead in the early 2000s?”, the model might generate the clarifying sub-question “What was Steve Jobs’ role at Apple in the early 2000s?”—this highlights a more precise angle that can be used to search more effectively. Unlike the original broad question, the sub-question targets a specific fact, making it easier to resolve with a tool call. This can then be used as a direct lookup query.

Next, the agent invokes an external tool, such as a web search engine, a structured database, or a computational API, to answer each sub-question. This phase is implemented through function calls or external service wrappers, depending on the available tool integrations.

Once a result is retrieved, it is incorporated into the agent’s context as a new input—often tagged or annotated for clarity. The agent then performs a second reasoning step, where it integrates the new information back into the main answer. This integration is not just a copy-paste of the data but a reevaluation of the original question in light of the retrieved facts.

The power of this pattern lies in its modularity: the agent doesn’t need to know everything in advance. Instead, it becomes a dynamic query orchestrator, directing attention toward knowledge gaps and systematically filling them. This approach is especially effective for fact-based research, data-intensive synthesis, and QA systems where external truthfulness and completeness are essential.

Research-Oriented Patterns

Research-oriented patterns focus on enabling LLM agents to go beyond their training data and gather live or domain-specific information through iterative external interaction. These patterns are particularly powerful when an agent needs to combine retrieval, synthesis, and critical evaluation of evolving information.

The basic structure of these patterns involves issuing queries, reviewing tool outputs (such as search results, structured data, or calculations), generating new hypotheses or follow-up questions, and refining the search. In many cases, the process is multi-hop or staged: the answer to the first query informs the next one, which then feeds into a final synthesis step.

These patterns can involve different tools (e.g., web search APIs, SQL databases, custom scrapers) and different prompts, tailored to match the kind of tool being used or to reinterpret the question based on new context. Often, intermediate answers are not discarded but preserved in memory or passed as annotations to later stages. This allows context to grow iteratively while also staying focused through mechanisms like trimming, summarization, or embedding-based filtering.

Overall, research-oriented agent behaviors closely resemble how human analysts work—asking, verifying, reframing, and integrating evidence over time. These techniques underpin use cases in academic writing, journalism, competitive intelligence, legal research, and scientific data exploration.

Multi-Hop Retrieval

Structure: Initial query → get sources → extract facts → follow-up questions → re-query → integrate
Used For: Building deep context, academic writing, journalism

The Multi-Hop Retrieval pattern enables an agent to iteratively deepen its understanding of a topic by combining multiple rounds of external lookups with intermediate reasoning. Rather than issuing a single search query and attempting to synthesize a final answer, the agent begins with an initial broad query designed to identify high-level context or relevant documents. From this first step, it retrieves a set of sources—such as paragraphs, snippets, or structured data—that serve as the groundwork for further exploration.

The agent then examines the initial context and extracts key facts or gaps—often using either a predefined prompt or a trained sub-agent whose job is to identify the most promising lines of inquiry. These insights lead to follow-up questions. The agent formulates these questions either explicitly (“What are the names of the key stakeholders mentioned in Source A?”) or implicitly through rephrasing prompts that narrow focus.

Each follow-up question triggers a second round of retrieval. This second hop may return answers directly or generate further factual threads, depending on the depth and complexity of the topic. At each hop, the agent integrates previous results with new ones. This is often handled by appending or summarizing new information into the current working context. If the token budget becomes an issue, context management strategies like summarization, salience filtering, or embedding similarity searches are used to preserve only the most important elements.

In many implementations, different prompts are used at each stage: one prompt might be tuned for broad retrieval, another for fact extraction, and a third for question refinement or hypothesis generation. This modular use of prompts, tools, and context results in a chain of growing, structured knowledge that is more complete and reliable than what a single query would yield.

Multi-hop retrieval is especially useful for academic research, investigative journalism, competitive analysis, and any domain where answers are layered or scattered across sources.

Tool-Triage Pattern

Structure: Classify input → route to appropriate tool/agent
Example: Use SQL if numeric, web search if open-ended

The Tool-Triage pattern empowers an LLM-based agent to act as an intelligent router, deciding which tool, sub-agent, or pathway is most appropriate to handle a specific input. Instead of treating all queries uniformly, the agent first classifies the type of question or request, then dynamically selects the most relevant execution method—whether that be a tool invocation, a different LLM prompt, or delegation to a specialized agent.

This classification typically uses a short reasoning prompt or a lightweight decision model. For example, the system might ask: “Is this a factual query, a computational question, or a freeform text request?” Based on the classification result, it will route the input appropriately: e.g., use SQL for structured data, WolframAlpha for math, or initiate a search query for open-ended questions.

Prompts in this pattern are often modular: a small classifier prompt might evaluate the input category, followed by tailored prompts for each downstream tool or agent. This enables general-purpose agents to scale across domains and tool types while remaining responsive and efficient.

In agent-based systems, Tool-Triage often acts as the entry point to more complex workflows. A router agent can take a user message and decide whether it should go to a summarizer, a translator, a researcher agent, or trigger a ReAct or Multi-Hop chain. This modularity allows systems to grow in complexity without overwhelming a single agent or model.

As a pattern, Tool-Triage plays a vital coordination role across the ecosystem of LLM tools—ensuring the right resources are used at the right time, improving both precision and efficiency.

Dynamic Query Reformulation

Structure: Generate query variants → search → compare → merge
Goal: Avoid keyword bottlenecks, increase robustness
Bonus: Pair with ranking and deduplication

The Dynamic Query Reformulation pattern addresses a common weakness in traditional query-based systems: human-provided queries often contain ambiguous or suboptimal phrasing, leading to poor or incomplete search results. Rather than relying on a single phrasing of a question, this pattern enables the LLM to generate multiple syntactic and semantic variants of a given query, each exploring a different angle, rewording, or assumption.

This approach has a major advantage: since the LLM itself creates the variants, it retains an understanding of what it intended with each one. It can adjust keywords, reframe intent, or disambiguate vague terms, all while preserving alignment with the original user request. For example, a question like “How did the energy market change in 2022?” might be reformulated into “Key events in the 2022 global energy market,” “Oil price trends in 2022,” or “Renewable energy investments during 2022.”

Each of these reformulated queries is then passed through a search or retrieval process—often using web APIs, document databases, or embedding-based search tools. The results are compared either via direct ranking (e.g. scoring documents by relevance to the user’s intent) or by asking the LLM to summarize and evaluate the usefulness of the retrieved content.

In the final phase, the agent merges the retrieved insights from multiple paths into a single answer. This process might involve identifying overlaps, contradictions, or complementary facts. A deduplication step is typically used to remove redundant phrasing, repeated facts, or near-identical sentences that may have appeared in multiple sources.

The ultimate goal of this pattern is robustness through diversity—instead of relying on a brittle one-shot query, the agent triangulates the answer space by exploring it from several directions. This greatly improves the reliability, coverage, and quality of research-driven or exploratory tasks, especially when using open-domain retrieval.

Introspective and Self-Evaluative Patterns

Introspective and self-evaluative patterns enable an agent to assess, critique, and refine its own outputs, rather than relying solely on external feedback or one-shot completions. At their core, these patterns introduce the notion of introspection—the process by which an agent reflects on its reasoning, language choices, or assumptions—and self-evaluation, where it judges its own performance and iteratively improves upon it.

In practice, introspection involves prompting the LLM to produce meta-level reflections: “Was this answer complete? Is there a possible flaw in this reasoning?” These reflections are not part of the final answer but guide the next step. Self-evaluation patterns, on the other hand, use modified prompts, different model configurations, or even auxiliary models to provide critiques, verifications, or ratings of previous outputs.

Instead of simply generating and stopping, the model is re-engaged with a follow-up prompt like “Please review the above output for errors, inconsistencies, or missing assumptions.” This critique can then be used to revise the original output or to branch into multiple refinement paths. The idea is to simulate an internal quality assurance mechanism—one that helps reduce hallucinations, improve coherence, and ensure alignment with the intended goals.

This layer of internal feedback is a key step toward autonomous, self-improving agents, and serves as the foundation for techniques like Self-Critique and Chain-of-Verification, which we explore in detail below.

Self-Critique / Reflective Iteration

Structure: Generate → Critique → Revise → Repeat
Used For: Enhancing coherence, reducing hallucinations
Variants: Parallel critics, voting systems

The Self-Critique or Reflective Iteration pattern is centered on an LLM’s ability to reflect on and iteratively improve its own outputs. Instead of producing a final result in one step, the model generates an initial answer, then enters a loop of self-analysis and revision. This allows the agent to detect potential flaws, refine logic, or improve clarity before returning a final result.

The process typically starts with an initial generation step, using a standard prompt for the task at hand. Afterward, the same or a secondary model is prompted with a request to critique that output—often using instructions such as “Identify any issues with the reasoning above,” or “What are the strengths and weaknesses of the previous answer?” These prompts guide the model to take a meta-level perspective, functioning like an internal reviewer or editor.

Once a critique is generated, the model is prompted again to revise the original output in light of that critique. The revision prompt often includes both the initial output and the critique in the context, asking the model to incorporate the improvements and produce a refined version. This cycle can be repeated multiple times for additional depth or quality control.

In terms of context, the working memory expands to include both the original answer and its critiques across iterations. If space is limited, older iterations may be trimmed or summarized. Different models may be used for critique versus generation, or one model can alternate roles depending on configuration.

This pattern is especially useful in applications where quality, coherence, or nuance matter—such as essay writing, summarization, report drafting, or scenario analysis. For instance, in a coding assistant, the model might first propose a function, then critique whether it covers all edge cases, and finally revise the code accordingly.

The visible effect is an output that appears more deliberate, justified, and internally consistent. It mimics the behavior of thoughtful human problem-solving: first trying, then reviewing, then refining. This makes Self-Critique a powerful tool in the design of autonomous agents striving for reliability and explainability.

Chain-of-Verification

Structure: Generate answer → justify → verify independently → cross-check
Variants: External fact-checking agents, rationale validation

The Chain-of-Verification pattern focuses on validating an agent’s output by using independent verification steps, often executed by a separate LLM instance or even a different model entirely. Unlike Self-Critique, which emphasizes internal self-reflection and iterative revision, Chain-of-Verification introduces external reasoning agents or modules that act as independent validators of previously generated content.

The process begins with an LLM producing an initial answer to a question or task. This is followed by an explicit justification step, where the model explains why the answer should be correct—similar to how a student might show their work. This justification is then handed to a second agent (or a version of the same model with a different verification prompt), which is tasked with fact-checking or cross-examining the answer and rationale. This verifier may use external tools like search engines, knowledge bases, or APIs, or it may rely purely on logic and reasoning.

Prompts used in this process are carefully designed to enforce a separation of roles. The verifier might be prompted with: “Here is an answer and a justification. Is this reasoning sound? Are the claims factual?” The result is a validation report or critique that either confirms the answer or highlights errors, gaps, or unsupported conclusions.

Context flows in a structured chain: first the original answer, then the explanation, then the verification, and finally a possible correction or confirmation. Unlike reflective iteration, where critique and correction are part of the same reasoning cycle, verification here is explicitly separated into distinct roles, reducing bias and self-reinforcement.

This pattern is particularly valuable in scenarios requiring factual accuracy, safety assurance, or auditability—such as legal advice, financial reports, scientific interpretation, or policy recommendations. For example, a financial agent might compute a projection, justify the assumptions used, and pass both to a verifier agent trained to catch flawed logic or unrealistic economic assumptions.

The key benefit of Chain-of-Verification is its rigor: it enforces multi-agent, multi-perspective scrutiny of outputs, leading to more defensible and trustworthy results.

Planning & Multi-Agent Patterns

Planning and multi-agent patterns are designed to enable agents to work together—either by coordinating tasks within a single process or by distributing subtasks across specialized agents. These patterns recognize that complex tasks often require decomposition, delegation, and synchronization of multiple steps that may differ in purpose or domain.

The basic premise of these patterns is modularity: one agent can plan, another can execute; one can specialize in generation, another in analysis or validation. Through this division of labor, systems become more interpretable, reusable, and scalable. Coordination is typically achieved through structured prompts, message passing, shared memory contexts, or intermediate artifacts such as plans, goals, or drafts. Some agents act as orchestrators, others as tools or collaborators.

Context in multi-agent systems is either shared (e.g. through a memory bus) or passed explicitly in structured exchanges. Prompt designs often emphasize clear role separation: “You are a planner,” or “You are a critic evaluating the following solution.” This framing not only enhances reasoning performance but also allows for dynamic composition of capabilities across agents.

These patterns are foundational in agent ecosystems where robustness, modular specialization, and collaborative intelligence are essential. Whether solving code generation pipelines, evaluating trade-offs, or running long-term processes with feedback loops, planning and multi-agent coordination unlock entirely new levels of autonomy and complexity.

Planner-Executor Architecture

Structure:
- Planner: Generates plan or pseudocode
- Executors: Carry out subtasks
Used For: Coding, pipeline design, research tasks

The Planner-Executor architecture is a two-tiered agent pattern that separates strategic planning from tactical execution. It is especially useful when tasks are complex, multi-step, or involve domain-specific tools that benefit from specialization.

The process begins with a Planner agent, which is prompted to analyze a user goal and break it down into a sequence of discrete, actionable steps. These steps can take the form of a natural language plan, pseudocode, function stubs, or task lists. The planner prompt typically frames the agent as an architect or project manager with instructions like: “Given the user’s goal, generate a step-by-step plan to accomplish the task using available tools.” This phase often uses a general-purpose LLM with strong reasoning capabilities.

Once the plan is generated, the task is passed step by step to one or more Executor agents, each of which is responsible for completing a specific subtask. These executors may be LLMs with specialized prompts (e.g., “Write a Python function that implements step 3”), or they may wrap non-LLM tools such as APIs, code compilers, or search engines. In some setups, each executor might be a different model entirely—chosen for its domain skillset or cost-performance profile.

Context management is modular: the planner maintains a high-level view of the overall objective, while executors focus only on their current step, often with access to the intermediate inputs and outputs of other subtasks. A shared memory or orchestrator may handle coordination between planner and executors, ensuring that updated state flows back into the system.

This architecture is well-suited for use cases like automated coding (planner creates structure, executors write and test functions), research pipelines (planner defines stages like literature review → data collection → summarization), or data processing workflows (planner maps out ETL steps, executors run the queries or scripts).

For example, to fulfill a request like “Generate a CSV report of recent tech IPOs,” a planner might break the job into (1) search for IPOs from 2023, (2) extract company info, (3) compile into CSV. Executors would then perform these steps, returning intermediate results. This pattern increases modularity, allows better error handling, and mirrors how real-world human teams divide labor effectively.

Planner-Executor architectures share some surface similarities with research-oriented patterns like Multi-Hop Retrieval, as both involve decomposing complex tasks and gathering or synthesizing information iteratively. However, the key distinction is that research patterns focus more on dynamically evolving information discovery—often driven by uncertain or exploratory contexts—while Planner-Executor architectures emphasize structured task execution, predefined workflows, and deterministic goal achievement. In many systems, these two patterns are combined: a planner may generate a plan that includes research steps, which are then executed using multi-hop retrieval agents or dynamic queries. The integration of both strategies allows agents to act with both structure and adaptability.

Debate or Argumentation

Structure: Opposing agents argue → neutral judge evaluates
Used For: Moral reasoning, design trade-offs

The Debate or Argumentation pattern introduces a form of structured multi-agent interaction in which two or more agents are prompted to adopt opposing viewpoints and reason in dialogue with one another. This approach is particularly powerful for tasks that involve moral trade-offs, design decisions, complex prioritization, or uncertainty, where no single answer is obviously correct.

To implement this architecture, agents are initialized with distinct prompts that frame them as adversaries or proponents of a specific stance. For instance, one agent might be instructed: “Argue in favor of adopting technology X,” while another is told, “Argue against using technology X.” These prompts ensure that each agent explores different reasoning paths while maintaining internal coherence within their assigned position.

Each agent then independently generates its first argument. These arguments are stored in isolated contexts to avoid immediate contamination or convergence. Once both initial positions are articulated, the agents are brought into a shared context or moderated turn-based exchange, where they respond to each other’s claims. The system prompts them with follow-ups like: “Respond to the counterarguments from the previous agent while reinforcing your position.”

This back-and-forth continues for several rounds. In each round, the agents receive both their own prior arguments and the latest statements from their opponent, maintaining a growing contextual window. Trimming strategies or salience filters may be used if the debate gets too long for a single context window.

Judgment is typically handled by a neutral evaluator agent or a separate LLM prompted with a summarizing instruction such as: “Review the arguments above. Which side presented a more convincing, well-supported position and why?” This final model may base its assessment on logic, completeness, persuasiveness, or factual grounding.

For example, in a medical ethics scenario, one agent might argue in favor of mandatory vaccination policies, while another opposes them on personal freedom grounds. The evaluator then synthesizes both sides and explains which perspective is more justified under public health criteria.

This pattern provides a powerful framework for exploring diverse perspectives, uncovering hidden assumptions, and enriching reasoning quality. It simulates deliberation and encourages agents to reason under constraints, yielding more robust and transparent outputs than single-agent generation.

Specialist Ensemble

Structure: Router agent delegates to specialists (e.g., summarizer, coder)
Infrastructure: Shared memory or context bus for result sharing

The Specialist Ensemble pattern is an agent architecture that distributes responsibility across a team of specialized agents, each tailored to perform a specific task or reasoning style. Rather than relying on a single generalist model, this pattern enables modularity, division of labor, and parallelism in processing complex tasks. The design typically begins with a Router agent, which analyzes an input query or goal and delegates subtasks to the appropriate specialists.

Specialist agents are prompted with targeted instructions that match their domain or function. For example, one agent might be prompted with “Summarize this academic paper,” another with “Translate the following paragraph into Spanish,” and yet another with “Write Python code to scrape tabular data from a website.” These agents often use variations of the same model but are separated by prompt context and sometimes by model configuration, memory, or temperature settings.

Context in these systems is managed through a shared memory space or message-passing infrastructure. Each specialist can read relevant input and write their output to a common memory bus. The router or coordinator agent then assembles these outputs into a coherent whole, possibly prompting another agent to review or finalize the result. Alternatively, context may be selectively passed forward: for instance, the output of a summarizer may become the input to a translator, forming a pipeline.

This pattern is especially effective in applications where modular stages are common—such as document processing pipelines, automated content generation, or report writing. For example, in an enterprise knowledge assistant, one agent may search the internal database, another may summarize the results, and a third may convert them into a formatted slide deck or client email.

By decomposing capabilities into reusable, isolated components, the Specialist Ensemble allows developers to scale and maintain complex agent systems more easily, while also enabling parallel execution and easier debugging. It mirrors how human teams often operate—with experts collaborating through a shared communication medium and delegated roles.

Human-in-the-Loop Feedback

Human-in-the-loop feedback refers to agent workflows that deliberately include human judgment, approval, or intervention at key stages of the reasoning or execution process. While much of the effort in AI system design focuses on full automation, there are many scenarios where retaining a human presence in the loop offers substantial benefits—especially for oversight, quality assurance, or moral and legal accountability.

In most applications, human involvement is considered a bottleneck due to cost, latency, or scalability concerns. The ideal system is often imagined as fully autonomous. However, this assumption breaks down in high-stakes or uncertain situations. In such cases, human review may be necessary to ensure safety (e.g., in legal, medical, or HR contexts), inject missing domain expertise, or prevent irreversible errors. Furthermore, when systems are still under development or tuning, human-in-the-loop feedback allows developers to steer and debug agent behavior more effectively.

Patterns in this category formalize how humans are integrated—whether by requiring approvals before advancing, guiding prompt evolution through testing cycles, or even annotating outputs for training. While this makes agents less autonomous in the short term, it makes them far more reliable, interpretable, and controllable in critical domains.

In the following sections, we will look at two major types of human-in-the-loop designs: one where humans review and approve outputs before continuation, and one where iterative performance tuning is guided by human analysis of intermediate results.

Approval Gating

Structure: Agent proposes → Human reviews → Accept/Reject → Continue
Used For: Safety checks, legal/HR contexts

The Approval Gating pattern introduces a checkpoint in an agent’s workflow where a human must explicitly approve, reject, or modify the model’s output before the process continues. This mechanism is most often used in high-risk or sensitive applications where oversight is essential—such as legal workflows, compliance reviews, HR screening, or publication pipelines. In these contexts, human approval serves not just as a correction mechanism but as a formal responsibility transfer, placing accountability on a person before irreversible actions are taken.

The process typically works by prompting the agent to propose a candidate response or action, which is then displayed to a human user with context such as the task, reasoning, and a summary of the tools or data used. The human can approve it as-is, request revisions, or reject it entirely. Depending on the application, feedback may either be used to retrigger the same agent with a modified prompt, or routed to a different agent (e.g., a rewriter, explainer, or simplifier) before returning to the approval loop.

From a technical standpoint, context may be frozen at the time of proposal and carried forward only upon approval, or enriched with annotations from human reviewers. Models used in this pattern are often general-purpose LLMs, but the prompts may include sections like “This will be reviewed by a human. Please be concise and explain your reasoning.”

In larger agent systems, approval gating can act as a boundary between fully autonomous components and supervised segments. For example, a Planner-Executor system might run unattended until a final proposal stage, which is paused for human review before publishing results. It can also integrate with Self-Critique or Chain-of-Verification to provide not just outputs, but critiques and justifications for human inspection.

One concrete use case is a news summarization bot for an internal knowledge base: after fetching and summarizing articles, it presents its summary and metadata to an editor, who confirms the accuracy and tone before allowing the content to be posted or distributed further.

Prompt Debugging Loops

Structure: Track prompt → generation → tool → outcome
Used For: Performance optimization, prompt tuning

Prompt Debugging Loops are used to systematically improve the performance of LLM-based agents by analyzing how well a given prompt performs over time in real usage scenarios. Rather than assuming that a prompt will work indefinitely or perfectly from the start, this pattern involves logging and evaluating the full trajectory of prompt execution—starting from the original prompt, through the LLM’s generation, any tool interactions that result, and finally the outcome or end result.

This loop typically involves capturing prompt input/output pairs, intermediate reasoning steps, and tool results. When the output is incorrect, incomplete, or suboptimal, developers can analyze the interaction history and ask: Did the prompt fail to guide the model clearly? Did it result in ambiguity or hallucination? Was there a mismatch between the prompt structure and the intended tool usage?

In this workflow, specialized debugging agents or human operators can be used to automatically or manually rewrite or evolve prompts. For example, if a tool is consistently called with incorrect parameters, a prompt debugging agent might revise the section of the prompt responsible for instructing tool invocation. Revised prompts are then reinserted into the system and the loop repeats.

This is particularly useful in long-running autonomous agents, RAG systems, and tool-augmented workflows where consistent performance is critical. A search agent that returns noisy or irrelevant results, for instance, may benefit from prompt debugging to better structure the query or disambiguate the context. It is also widely used during development to rapidly iterate on prompt templates.

In large systems, prompt debugging loops complement patterns like Approval Gating or Chain-of-Verification by revealing systemic failure points and giving operators a handle to steer agent behavior without retraining or redesigning the entire system.

Memory-Aware Iterative Reasoning

Memory-aware iterative reasoning refers to a class of agent patterns where past interactions, computations, or decisions are retained across multiple reasoning steps to enhance consistency, depth, and adaptability. Unlike stateless agents that process every task in isolation, memory-aware agents use mechanisms for tracking and retrieving relevant context as they progress through tasks or sessions. This memory can be short-term—such as maintaining the last few messages in a rolling buffer—or long-term, involving persistent storage of facts, hypotheses, or intermediate results across sessions.

The core idea is that access to relevant memory allows agents to behave more coherently over time, avoid redundant work, revisit earlier steps when new information arises, and build increasingly sophisticated responses. In an iterative reasoning loop, the agent might refine answers, update hypotheses, or accumulate knowledge. Memory plays a key role in anchoring the agent’s actions to prior reasoning steps, enabling complex workflows that evolve dynamically based on history. While managing memory introduces new challenges—like how to summarize, trim, or structure recalled information—it is essential for any system that aims to behave thoughtfully over extended interactions or multi-step processes.

Rolling Window Context

Structure: Retain last N steps or relevant entries via embeddings
Used For: Stateless agents simulating continuity

The Rolling Window Context pattern is a strategy for managing limited memory capacity in stateless or partially stateful agents by retaining only the most recent and relevant pieces of context. Instead of storing and processing the entire history of interactions, the agent is fed a truncated version of the conversation or task history—typically the last N steps—allowing it to operate within a constrained token budget while still simulating continuity and memory.

This is often implemented either by naively slicing the last few messages or by using embedding-based similarity filtering to select the most relevant context chunks dynamically. In multi-agent systems, each agent may operate within its own rolling window, or a shared coordinator may provide a pruned context for every agent turn. This requires careful curation to preserve key dependencies while minimizing repetition.

Rolling window context is particularly useful in systems where the agent must simulate conversational continuity or reason over evolving tasks—such as chatbots, virtual assistants, or stepwise planners. For instance, a customer support agent may only retain the last few exchanges to avoid referencing outdated or resolved issues, while still appearing to “remember” what the user just said. It’s also applicable in long-chain planning tasks where memory is ephemeral by design but contextual grounding in recent state is necessary.

While it does not provide persistent memory across sessions, rolling window context gives the illusion of it and serves as a foundational mechanism in constrained compute or API-bound environments where full memory recall is impractical.

Long-Term Scratchpad

Structure: Persistent storage of results, assumptions, facts
Used For: Session continuity, persistent memory

The Long-Term Scratchpad pattern provides agents with persistent memory that can span across sessions or iterations, allowing for deep continuity and accumulation of knowledge over time. Unlike rolling window memory, which is transient and bounded by a short context window, long-term scratchpads are designed to store structured outputs, facts, assumptions, observations, or working hypotheses that can be retrieved and referenced at any point in future workflows.

Context can be managed in several ways. One method is by reserving space in the system message or prompt preamble to carry important persistent facts forward. Another method uses an external memory system—such as a vector database, document store, or structured key-value store—where agent-generated content is indexed and retrievable by semantic similarity or task-specific filters. In this setup, the agent can access its own memory via function calls, API endpoints, or retrieval agents that inject relevant pieces of memory back into the context for the current task.

This mechanism allows the agent to offload historical knowledge without exceeding context limits. For example, when handling a complex customer onboarding workflow, an agent can store information like names, configurations, and past decisions in its long-term memory and retrieve them when follow-up actions are needed. The scratchpad can grow over time and be selectively queried depending on the task.

Effective use of long-term scratchpads involves prompt templates that tell the agent when to write to memory (e.g., “Store this insight”) and when to retrieve or query (e.g., “Search for prior tasks involving this customer”). The scratchpad system may consume 10–30% of the prompt context when used directly, but much of it can be held externally and injected on demand to preserve token budget. This enables agents to engage in reflective work, track evolving projects, or behave like domain-specific assistants that develop expertise over time.

Simulation-Based Patterns

Simulation-based patterns are agent designs that simulate dynamic, interactive environments or role-based reasoning scenarios. Rather than focusing solely on problem-solving or static question answering, these patterns create a space where agents can engage in behaviors that unfold over time or through interaction with other agents or systems.

The core idea is to emulate decision-making and behavior within a structured setting—such as an imaginary workplace, a simulated conversation, or a virtual world. These simulations can involve multiple agents assuming different roles, each with its own prompt and memory, or they can involve a feedback loop between an agent and an evolving external environment. The agent’s actions influence the state, and the new state in turn informs the next set of actions, forming a loop.

Simulation-based patterns are particularly useful for complex training scenarios, ideation processes, policy modeling, tutoring, product testing, and robotics. They support explorative and emergent reasoning where the goal is not just the answer but the process of interaction and adaptation. In these setups, different agents may represent stakeholders, domain experts, critics, or persona-based users, making the system both flexible and more human-like in reasoning.

We will explore two foundational approaches within this category: role-based simulations, where agents collaborate or argue from distinct roles, and environment-agent loops, where agents continuously observe and act within a changing external system.

Roleplay Simulation

Structure: Agents assume different roles and interact
Used For: Scenario exploration, decision making

Roleplay Simulation patterns involve constructing a scenario where multiple agents (or agents and users) assume different roles within a defined context and interact based on those roles. These simulations can be designed for collaborative problem-solving, conflict resolution, creative ideation, or decision-making training. Each agent is given a tailored prompt that defines its identity, motivation, knowledge scope, and communication style—mimicking how real stakeholders might engage in a situation.

In multi-agent setups, interactions can be fully autonomous without any human in the loop. The simulation proceeds through turn-based exchanges or mediated rounds where agents take actions or respond based on the evolving dialogue or shared virtual state. Prompts often include role-specific instructions such as “You are a security advisor who prioritizes privacy over efficiency” or “You are a product manager representing user concerns.”

Beyond text-based interactions, roleplay simulations can also incorporate external code-based environments. In these hybrids, the LLM may make decisions—like selecting parameters for an experiment or predicting outcomes—and those decisions are fed into a code-based simulation that uses real-world physics or economic models. The results (e.g., success metrics, error values, new states) are returned to the agents, who interpret and act upon them in the next round. An evaluator agent or rule-based metric system may serve as the judge of whether a decision improved the scenario.

This setup is especially powerful in experimental design, policy exploration, strategic planning, and scientific discovery. For example, in physics, one can create a theory-testing environment where agents propose competing interpretations of a phenomenon, make predictions based on different models, and evaluate those predictions using actual simulation data governed by known physical laws. The best-fitting explanation can be identified through iterative refinement and scoring.

Roleplay simulations bring depth and dynamic reasoning to LLM agents by modeling not just knowledge, but behavior and interaction patterns over time.

Environment-Agent Loop

Structure: External system returns state → Agent acts → System updates
Used For: Robotics, games, external simulations

The Environment-Agent Loop is a simulation pattern in which an LLM-based agent interacts with an external system that evolves over time, responding to and modifying its internal state based on the agent’s decisions. In each iteration, the agent observes a new system state, reasons about it, takes an action, and receives updated state data or feedback—forming a closed-loop interaction. This enables agents to operate in dynamic, interactive contexts such as robotics, automation, games, simulations, or real-world monitoring systems.

To implement this pattern, external systems—such as sensors, APIs, or device controllers—must be connected through interfaces that feed structured data into the agent. For example, a home automation system may provide presence sensor updates or time-of-day data via MQTT topics. The agent subscribes to topics like sensor/presence/livingroom, receives messages about motion detection or environmental changes, and responds by sending commands to MQTT outputs like home/lights/scene or climate/thermostat/setpoint. This interaction may happen in real time or in a batched, simulated time loop.

Context management in this setup typically involves maintaining a recent log of observed states and prior decisions. This can be held in the agent’s prompt context, offloaded to memory via a scratchpad, or summarized in key variables. Depending on task complexity, different models may be used for observation interpretation, decision-making, and consequence evaluation. Function calling or tool invocation APIs are often used to connect the LLM to real-world effectors.

For example, a home automation agent may observe that a user has entered the hallway and that it’s after sunset. It then checks stored preferences and decides to activate ambient corridor lighting and lower the music volume in the adjacent room. The prompt for such a decision might include: “Given presence in room X and current time, select the most appropriate scene settings for lighting and audio.”

Environment-agent loops also support code-driven environments. A physics lab assistant might simulate running an experiment by choosing parameters via the LLM, applying them in a code simulation, and observing results like oscillation period or error bounds. The agent then adapts its strategy in subsequent loops.

This pattern is highly effective for systems requiring adaptation, physical-world interfacing, or procedural execution under observation. It forms the foundation for autonomous control, test automation, adaptive agents, and embodied AI systems.

Autonomous Invocation Patterns

Structure: Agent listens for external triggers (pubsub, API, MQTT) or time-based events (cron)
Used For: Monitoring systems, periodic reports, reactive assistants
Example: Agent listens on MQTT topic alerts/# and performs reflective planning when a new alert is published
Bonus: Use with persistent memory for long-term autonomous operation

Autonomous Invocation Patterns refer to a class of designs where agents initiate their own activity without requiring immediate user input. These agents operate continuously or on-demand in response to time-based schedules, external signals, or environmental changes. The objective is to build systems that act proactively—observing, reasoning, and reacting autonomously—rather than responding only to direct queries or instructions.

While these patterns share some similarities with environment-agent loops, the key distinction is that autonomous invocation emphasizes the agent’s trigger mechanism—the moment and reason it decides to start reasoning or acting. This could be a time-based trigger (such as a daily report generator scheduled via cron), an event from an external message bus (like an MQTT alert topic), or a change detected in the agent’s internal or external memory (such as new data in a database or file system).

The central idea is to make agents responsive and continuous—capable of handling asynchronous workflows, monitoring systems, or alert-driven logic. Prompts are often templated and reused, with context injected at trigger time from the event payload, recent state logs, or memory snapshots. These systems commonly integrate with persistent memory or a scratchpad to track status across invocations.

In more advanced scenarios, agents may even be allowed to modify their own objectives or system prompts—within defined boundaries—based on environmental cues or performance outcomes. This allows for adaptive behavior as context changes, enabling agents to reprioritize tasks, adjust verbosity, or take on new roles.

Autonomous invocation can also support multi-agent orchestration. In such systems, agents may spawn new agents to pursue subtasks, modify existing agents’ configurations, or decommission outdated ones. This provides a scalable, modular way to build persistent, evolving agent collectives that collaborate asynchronously.

Autonomous invocation is especially useful for monitoring pipelines, background analytics, alert handlers, system health watchers, or any scenario where immediate user input is rare or unnecessary. It allows LLM-powered agents to become background processes—reactive, reflective, and even collaborative with other agents as events unfold.

Running JupyterLab Behind an Apache Reverse Proxy

2025-03-27T00:00:00+01:00

Introduction

When exposing a JupyterLab server to the internet, especially one running on an internal machine, it’s important to do so securely and robustly. This quick guide covers how to place JupyterLab behind an Apache 2.4 reverse proxy with HTTPS termination, using JupyterLab 4.2.5 running on an internal host. We walk through the necessary Apache configuration, address common websocket issues, and explain what must be configured on the JupyterLab side to ensure proper operation.

In our setup, JupyterLab runs on an internal machine with IP 192.0.2.100 on port 8888. Apache httpd acts as the HTTPS-facing reverse proxy and is accessible to the public via jupyter.example.com.

Apache Configuration
JupyterLab Configuration

Apache Configuration

The Apache VirtualHost configuration enables both standard HTTP proxying and websocket tunneling. Websockets are required for many interactive features in JupyterLab, including launching kernels and receiving output. Misconfigured websocket proxying is a common cause of broken kernel behavior, such as failures to start or cells not producing output.

Here’s the relevant Apache 2.4 configuration:

        ServerName jupyter.example.com
        ServerAdmin webmaster@example.com
        DocumentRoot /usr/www/jupyter.example.com/www

        SSLOptions +StdEnvVars
        SSLVerifyClient optional
        SSLVerifyDepth 5
        SSLCertificateFile "/usr/www/jupyter.example.com/conf/ssl.cert"
        SSLCertificateKeyFile "/usr/www/jupyter.example.com/conf/ssl.key"
        SSLCertificateChainFile "/usr/www/jupyter.example.com/conf/ssl.cert"
        SSLCACertificateFile "/usr/local/etc/ssl/ca01_01.cert"

        RewriteEngine On
        RewriteCond %{HTTP:Connection} Upgrade [NC]
        RewriteCond %{HTTP:Upgrade} websocket [NC]
        RewriteRule /(.*) ws://192.0.2.100:8888/$1 [P,L]

        ProxyRequests Off
        ProxyVia Off
        ProxyPreserveHost On
        RequestHeader set "X-Forwarded-Proto" expr=%{REQUEST_SCHEME}

        ProxyPass / http://192.0.2.100:8888/ nocanon
        ProxyPassReverse / http://192.0.2.100:8888/

        RequestHeader edit Origin ^(.*)$ $1
        RequestHeader edit Referer ^(.*)$ $1

        Header unset Content-security-policy
        Header unset Content-security-policy-report-only

        Header always set Access-Control-Allow-Origin "*"

This setup forwards normal HTTP requests and websocket upgrade requests to the internal JupyterLab server. The key parts are the RewriteCond rules and RewriteRule, which ensure websocket connections are passed properly. The ProxyPreserveHost and X-Forwarded-Proto header help the internal server understand the original request’s context.

JupyterLab Configuration

On the internal host running JupyterLab, you must configure it to listen on all interfaces (0.0.0.0), accept connections from the proxy host, and allow remote access. This is done in your ~/.jupyter/jupyter_lab_config.py file:

c.ServerApp.port = 8888
c.ServerApp.local_hostnames = ['jupyter.example.com']
c.ServerApp.ip = '0.0.0.0'
c.ServerApp.allow_remote_access = True
c.ServerApp.allow_origin = '*'

Setting the port to 8888 matches what Apache forwards to. Binding to 0.0.0.0 ensures the JupyterLab server accepts connections from any interface, including from Apache on the same or a different host. The local_hostnames setting tells JupyterLab which hostnames are allowed to appear in requests; this must include the public domain name used by clients. allow_remote_access is necessary to let the proxy connect from another machine, and allow_origin = '*' ensures CORS doesn’t block legitimate browser requests that may originate from outside the internal network.

Conclusion

Running JupyterLab behind an Apache reverse proxy is a practical way to provide secure, HTTPS access to a notebook server running on an internal machine. The main pitfall is usually related to websocket support; getting the rewrite and proxy rules correct is critical for kernel functionality. With the configurations above, JupyterLab should be accessible from the outside while maintaining flexibility and a reasonable level of security.

Python Logging: A Beginner-Friendly Tutorial and Practical Reference

2025-03-25T00:00:00+01:00

Introduction

Logging is the process of recording information about a program’s execution. This information can include debug messages, informational events, warnings, errors, and critical issues. Python’s logging module provides a robust and configurable system to write these messages to various outputs such as the console, files, or system log services.

Python’s built-in logging module provides a flexible and extensible framework for emitting log messages from Python programs—similar in purpose to well-known systems like Log4j in Java, log4c in C-based environments, or the Logger class in .NET applications. Whether you’re debugging a script, monitoring a running service, or building a reusable component library, Python logging offers a professional, scalable solution for integrating structured, controllable, and environment-appropriate diagnostic output into your code.

In this short overview tutorial, we’ll cover:

What logging is and why it’s better than print()
How to configure and use basic logging
Using loggers in scripts and inside classes
Passing loggers to modules or functions
Understanding logging hierarchy
Logger types: stream, file, syslog
Recipes for common use-cases
Tips for writing clean and scalable logging setups

Content

What Is Logging and Why Use It?
First Steps: Basic Logging
Logging Levels Explained
Creating and Using Named Loggers
Using Logging in Classes and Functions
Hierarchical Loggers
Logger Types: Handlers and Output Destinations
Custom Formatters and Handlers
Logging Configuration with dictConfig
Conclusion

What Is Logging and Why Use It?

Logging refers to the structured recording of events, diagnostics, and execution details within a program. Unlike ad-hoc print() statements, which are suitable only for quick-and-dirty debugging, logging offers a scalable, configurable, and long-term solution for understanding and monitoring program behavior. It provides developers with insight into how their applications behave under normal conditions, as well as when unexpected situations occur. Crucially, logging can remain active in deployed applications, serving as a permanent instrumentation layer for tracing, diagnosing, and resolving bugs in production environments without altering the codebase itself.

With logging, messages can be categorized by severity levels such as DEBUG, INFO, WARNING, ERROR, and CRITICAL. These logs can be automatically routed to different destinations such as standard output, files, system logs, or even over the network. Logging also enables the inclusion of contextual metadata like timestamps, log levels, and the module or function from which the log originated. Importantly, Python’s logging system can be configured centrally and hierarchically, giving fine-grained control over how messages are filtered and where they are sent throughout large applications. Hierarchical logging means that loggers are organized in a tree-like structure that reflects the module or package structure of your codebase. For example, a logger named myapp.module.submodule is a child of myapp.module, which in turn is a child of myapp. Messages sent to child loggers propagate upward unless explicitly blocked, and handlers can be attached at any level in the hierarchy to control the formatting and destination of log messages. This design allows you to configure logging behavior globally while still enabling module-specific overrides when needed.

First Steps: Basic Logging

Let’s begin with a minimal example that shows how to activate Python’s logging system and produce a simple message. This example configures the logging module to display informational messages and above (INFO, WARNING, ERROR, and CRITICAL) and then writes a single informational message to standard output:

import logging

logging.basicConfig(level=logging.INFO)
logging.info("Hello, world!")

Explanation

In this example, we start by importing Python’s built-in logging module. The call to logging.basicConfig(level=logging.INFO) sets up a default logging configuration that includes a basic handler which writes to the console. It also sets the log level threshold to INFO, meaning that all messages at the INFO level or higher will be shown, while DEBUG messages will be ignored.

The call to logging.info("Hello, world!") produces a log message at the INFO level. This message will appear in the terminal output because it meets the severity threshold specified in the configuration. If we wanted to see debug messages as well, we could change the level to logging.DEBUG. Conversely, raising the level to WARNING would hide the INFO message.

This minimal setup demonstrates how logging can replace print statements in a more robust and configurable way, laying the foundation for more advanced usage as your application grows.

Logging Levels Explained

So what are the logging levels, and why do they matter? Logging levels are categories that indicate the severity or importance of a log message. By assigning a level to each message, you can control which messages are recorded or displayed depending on the configuration. This allows you to reduce noise in production environments while still capturing fine-grained detail when debugging. The Python logging module defines five standard levels, ranging from DEBUG (the most verbose) to CRITICAL (the most severe):

Level	Description and Example
`DEBUG`	Fine-grained details useful for diagnosing problems during development. E.g., “User input received: ‘42’” or “Fetching data from API at URL…”
`INFO`	General events showing normal operation. E.g., “Application started” or “User logged in successfully”
`WARNING`	Something unexpected happened, but the program can still proceed. E.g., “Configuration file not found, using defaults”
`ERROR`	A serious issue that occurred during execution but didn’t crash the program. E.g., “Failed to connect to database”
`CRITICAL`	A severe error that may cause the program to terminate. E.g., “Unrecoverable system failure”

Creating and Using Named Loggers

As your projects grow in size and complexity, it’s often useful to distinguish where log messages come from. This is where named loggers come into play. Rather than using the root logger for everything, you can create named loggers using logging.getLogger(name). This makes it easier to trace logs back to specific modules or components of your application. Additionally, named loggers participate in the logging hierarchy, which means their configuration can inherit from parent loggers — a concept we’ll explore in detail in the next section.

logger = logging.getLogger("myapp.module")
logger.setLevel(logging.DEBUG)

logger.debug("This is a debug message.")
logger.warning("This is a warning.")

In this example, we create a logger explicitly named myapp.module using logging.getLogger("myapp.module"). This name not only identifies the source of the log messages, but also places the logger within a hierarchy — the logger myapp.module is a child of myapp, and both are ultimately children of the root logger. We set the log level to DEBUG, which allows all messages of level DEBUG and above to be emitted.

We then emit two messages: one at the DEBUG level and one at the WARNING level. Depending on the logger’s configuration and handlers, these messages will be shown or ignored. If no specific handlers are attached to the logger, messages propagate up to the root logger, which by default outputs to the console. This behavior can be fine-tuned later using handlers, formatters, and the propagate flag.

Using Logging in Classes and Functions

When using logging inside classes and functions, there are a few helpful patterns that make life easier. One of them is using the built-in __name__ variable, which refers to the current module’s name. This allows you to automatically generate meaningful logger names without hardcoding strings. Similarly, when writing reusable classes, you might want to either create a logger specifically for that class or accept an external logger instance for better integration in larger applications.

This design also helps when writing libraries or components that may be reused elsewhere. By supporting logger injection, you’re allowing the surrounding application to control how logging is handled—whether it’s printed to the console, written to a file, or completely silenced.

The example below demonstrates this pattern: if no logger is passed in, a default one is created using the module’s __name__. This allows the logger name to reflect the module where the class is defined, making log messages easier to trace.

class MyWorker:
    def __init__(self, logger=None):
        self.logger = logger or logging.getLogger(__name__)

    def do_work(self):
        self.logger.info("Doing some work")

# Usage
worker = MyWorker()
worker.do_work()

In this example, the MyWorker class accepts an optional logger argument in its constructor. If no logger is provided, it falls back to creating a module-level logger using logging.getLogger(__name__). This allows each class or component to log messages under a consistent, hierarchical namespace without hardcoding names.

Calling do_work() triggers a call to self.logger.info(...), which emits a log message using the configured logger. This pattern makes it easy to test and reuse the class in different contexts, while maintaining full control over logging behavior.

Hierarchical Loggers

As your application grows and includes multiple components or modules, it becomes useful to structure your logging setup hierarchically. Hierarchical loggers allow you to organize loggers in a tree-like fashion, where each logger name reflects its place in the project structure. For example, a logger named myapp.module.submodule is considered a child of myapp.module, which in turn is a child of myapp.

This structure not only helps in identifying the source of each log message, but also enables powerful configuration. Messages generated by a child logger will automatically propagate or “bubble up” to its parent, unless configured otherwise. This means that you can attach handlers at a higher level (like myapp) and automatically collect logs from all submodules without repeating the configuration.

# Logging hierarchy is dot-separated:
root = logging.getLogger()
parent = logging.getLogger("myapp")
child = logging.getLogger("myapp.module.sub")

child.warning("I inherit handlers from parent")

In this example, we create three loggers to demonstrate the hierarchy: root, myapp, and myapp.module.sub. The root logger is always present and sits at the top of the hierarchy. The logger myapp.module.sub is a child of myapp, and unless explicitly configured not to, it will propagate its messages upward.

When we log a warning message using the child logger (myapp.module.sub), that message will propagate up the hierarchy and be handled by any handlers attached to parent loggers, including myapp and root. This mechanism provides a scalable way to manage logging behavior across an entire application, while still allowing individual modules to override or extend the behavior if necessary.

Logger Types: Handlers and Output Destinations

Loggers themselves do not output messages—they rely on handlers to determine what should be done with log records. A handler is responsible for sending the log message to a destination: this could be standard output, a file, or a system logging service. You can attach multiple handlers to a single logger, and each handler can have its own level and formatting.

In the context of hierarchical logging, when a logger emits a message, that message is passed to all of its own handlers. If the logger is configured to propagate messages (which is the default), the same message is then passed up the hierarchy to the parent logger, and so on, until it reaches the root. Each logger along the path may have one or more handlers, and all of them will be triggered unless propagation is disabled or a level filter suppresses the message.

For example the following handlers are available:

NullHandler: Discards all logging messages. Useful when writing libraries so that logging calls don’t generate output unless configured by the application.
StreamHandler: Sends logs to streams such as sys.stdout or sys.stderr. Commonly used for console output.
FileHandler: Writes log records to a file. Ideal for persistent logging.
WatchedFileHandler: Similar to FileHandler, but watches for file changes (like log rotation) and reopens the file if needed. Unix-only.
RotatingFileHandler: Automatically rolls over to a new file when the current file reaches a certain size, keeping backup copies.
TimedRotatingFileHandler: Rotates logs at specific time intervals (e.g., daily, hourly), optionally keeping backups.
SysLogHandler: Sends logs to a system syslog daemon. Useful for integrating with system-wide logging on Unix/Linux.
SocketHandler: Sends log records over a TCP socket to a remote logging server.
DatagramHandler: Sends logs over a UDP connection instead of TCP. Lightweight but unreliable.
MemoryHandler: Buffers log records in memory and flushes them to a target handler based on size or severity.
HTTPHandler: Sends logs to a remote HTTP server via GET or POST requests. Can be used to send logs to web APIs.
SMTPHandler: Sends error logs via email using SMTP. Useful for alerting administrators about critical failures.
QueueHandler: Sends logs to a thread- or process-safe queue. Typically used in multiprocessing or multi-threaded applications to decouple logging from worker threads or processes.

Below are examples of commonly used handlers and how they are configured and attached to a logger:

StreamHandler (stdout / stderr)

import logging
import sys

handler = logging.StreamHandler(sys.stdout)
logger = logging.getLogger()
logger.addHandler(handler)
logger.setLevel(logging.INFO)
logger.info("This message is sent to standard output")

This handler writes log messages to a stream, such as sys.stdout or sys.stderr. If no handler is configured explicitly, Python defaults to using a StreamHandler that outputs to stderr. Stream handlers are ideal for command-line applications and development use, where logs are immediately visible in the console.

FileHandler

import logging

handler = logging.FileHandler("app.log")
logger = logging.getLogger()
logger.addHandler(handler)
logger.setLevel(logging.INFO)
logger.info("This message is written to a file")

The FileHandler writes log messages to a specified file. This is especially useful for long-running services, background jobs, or any application where persistent records are needed. In more advanced setups, file handlers can be extended with log rotation via RotatingFileHandler or TimedRotatingFileHandler to avoid uncontrolled file growth.

SysLogHandler

import logging
from logging.handlers import SysLogHandler

handler = SysLogHandler(address='/dev/log')  # On Unix; or use ('localhost', 514) for UDP
logger = logging.getLogger()
logger.addHandler(handler)
logger.setLevel(logging.WARNING)
logger.warning("This message is sent to the system log")

The SysLogHandler allows integration with the system-wide logging facility available on most Unix-like operating systems. It can send logs via a Unix domain socket or UDP. This is often used in production deployments where centralized logging and monitoring systems aggregate messages from multiple sources.

NullHandler (for libraries)

import logging

logger = logging.getLogger(__name__)
logger.addHandler(logging.NullHandler())

The NullHandler is commonly used in libraries to suppress any logging output unless the end user explicitly configures logging. It prevents log messages from propagating to the root logger and displaying unwanted output.

HTTPHandler

import logging
from logging.handlers import HTTPHandler

handler = HTTPHandler('localhost:8000', '/log', method='POST')
logger = logging.getLogger()
logger.addHandler(handler)
logger.setLevel(logging.ERROR)
logger.error("This message is sent via HTTP POST")

The HTTPHandler sends log records to a remote web server using HTTP GET or POST requests. It is useful for forwarding logs to centralized APIs or log collection endpoints. When using the POST method, the log record is URL-encoded and sent as form data, with a Content-Type of application/x-www-form-urlencoded. The payload includes fields such as name (the logger name), levelname, msg (the formatted message), and other attributes of the log record. On the server side, this data can be processed like any form submission, typically by reading POST parameters from the request body.

SMTPHandler

import logging
from logging.handlers import SMTPHandler

handler = SMTPHandler(
    mailhost=('smtp.example.com', 587),
    fromaddr='error@example.com',
    toaddrs=['admin@example.com'],
    subject='Application Error',
    credentials=('user', 'password'),
    secure=()
)
logger = logging.getLogger()
logger.addHandler(handler)
logger.setLevel(logging.CRITICAL)
logger.critical("This critical error was emailed")

The SMTPHandler sends log messages via email. It’s typically used for high-severity alerts such as uncaught exceptions or critical failures that require immediate attention.

SocketHandler and DatagramHandler

import logging
from logging.handlers import SocketHandler, DatagramHandler

# TCP logging
tcp_handler = SocketHandler('localhost', 9020)
# UDP logging
udp_handler = DatagramHandler('localhost', 9021)

logger = logging.getLogger()
logger.addHandler(tcp_handler)
logger.addHandler(udp_handler)
logger.setLevel(logging.WARNING)
logger.warning("This message is sent over TCP and UDP")

The SocketHandler sends log records over a TCP connection, while the DatagramHandler uses UDP. These handlers are useful for transmitting logs to remote servers or log aggregators, often in centralized or distributed systems.

When a message is emitted through either handler, the log record is serialized using Python’s pickle module and transmitted over the network. The format is a length-prefixed binary stream, where the first 4 bytes specify the length of the pickled payload (in network byte order), followed by the actual pickled data. This applies to SocketHandler. For DatagramHandler, which uses UDP, the message consists only of the pickled data without a length prefix.

The server that receives these messages must therefore be capable of unpickling the incoming data. This implies that the receiving server should be implemented in Python or another system capable of deserializing Python pickle data, which is inherently insecure if not protected — always ensure only trusted clients can send log messages to your log collector.

Custom Formatters and Handlers

While the default logging format provides essential information, many real-world applications benefit from customizing how log messages are displayed or processed. Custom formatters allow you to control the structure and content of each log message—this might include timestamps, log levels, source module names, or even user-defined fields. Similarly, custom handlers let you define exactly how and where log messages are delivered, whether that’s a rotating file, an external monitoring service, or a custom alerting pipeline.

Custom formatters are applied to handlers and define the string representation of the log record. By tailoring the format string, you can make your logs more readable or easier to parse by automated tools.

The example below shows how to apply a formatter to a stream handler and attach it to a logger:

handler = logging.StreamHandler()
formatter = logging.Formatter('[%(asctime)s] %(levelname)s - %(message)s')
handler.setFormatter(formatter)

logger = logging.getLogger("custom")
logger.addHandler(handler)
logger.setLevel(logging.DEBUG)

logger.debug("Custom formatted message")

Here, we create a StreamHandler and apply a Formatter to it. The format string "[%(asctime)s] %(levelname)s - %(message)s" includes a timestamp, the log level, and the actual log message. This formatted output is then applied to the handler and used by the logger to emit structured logs.

The format string can include a wide range of placeholders taken from the LogRecord object. Below is a table with commonly used format fields:

Placeholder	Description
`%(asctime)s`	Time the log message was created
`%(levelname)s`	Text logging level (e.g., ‘INFO’, ‘ERROR’)
`%(name)s`	Name of the logger that emitted the record
`%(message)s`	The actual logged message (after argument substitution)
`%(filename)s`	Filename portion of `pathname`
`%(pathname)s`	Full pathname of the source file
`%(module)s`	Module name of the caller
`%(funcName)s`	Name of function that called the logger
`%(lineno)d`	Source line number where the logging call was made
`%(threadName)s`	Name of the thread in which the log message was issued
`%(process)d`	Process ID (PID) of the process that issued the log
`%(levelno)s`	Numeric logging level (e.g., 20 for INFO)
`%(created)f`	Time the LogRecord was created (as a UNIX timestamp)
`%(relativeCreated)d`	Time in milliseconds since logging system was loaded

You can combine these placeholders into any format string depending on your needs. Handlers may also filter messages using custom filter classes if you need additional control beyond formatting.

Logging Configuration with `dictConfig`

For small scripts or simple applications, configuring logging directly in code may be sufficient. However, for larger or more configurable applications, it’s often better to define your logging configuration using structured data. Python’s dictConfig system allows you to describe the entire logging setup using dictionaries. This enables you to store your logging configuration externally as JSON or YAML, making it easy to adjust without touching the source code.

Using dictConfig, you can define formatters, handlers, filters, and loggers in a declarative way. This becomes particularly powerful when deploying applications across environments where different logging behavior is needed (e.g., more verbose in development, silent in production).

Once configured using dictConfig, you can retrieve loggers in your application as usual with logging.getLogger("myapp"), and they will behave according to the configuration specified. If a logger is not explicitly defined in the configuration, it will fall back to default behavior or inherit from parent loggers, depending on the hierarchy.

import logging.config

logging.config.dictConfig({
    "version": 1,
    "formatters": {
        "standard": {
            "format": "%(asctime)s - %(name)s - %(levelname)s - %(message)s"
        },
    },
    "handlers": {
        "console": {
            "class": "logging.StreamHandler",
            "formatter": "standard",
        },
    },
    "loggers": {
        "myapp": {
            "handlers": ["console"],
            "level": "INFO",
        },
    },
})

Conclusion

Python’s logging module is a powerful and scalable tool for capturing, analyzing, and persisting application behavior. Unlike simple print statements, logging enables structured, multi-level message reporting that can be routed to different outputs and filtered by context, severity, or component. This makes it not only a superior tool for debugging but also a long-term investment in application observability.

Even in programs that may appear simple at first, incorporating proper logging can pay off later when tracing unexpected behaviors, reproducing user reports, or monitoring system health in production. With flexible configuration options, hierarchical logger design, and support for custom handlers and formats, Python logging provides a solid instrumentation foundation that grows with your software.

By integrating logging thoughtfully into every component—no matter how trivial—it becomes significantly easier to diagnose and resolve problems both during development and long after deployment.

Social Media Algorithms and Harmful Content: Societies mirror vs. Intent

2025-03-15T00:00:00+01:00

TL;DR: Is social media really fueling hate on purpose? Contrary to popular belief, there’s no solid evidence of deliberate harmful intent. Instead, algorithms amplify whatever sparks the most engagement—often divisive or emotional posts—simply because we, as users, pay more attention to them. This dynamic functions more like a mirror of society’s existing biases than a direct manipulation scheme. The real tension emerges when the rush for clicks collides with long-term user well-being, revealing that negativity thrives largely through our own online behaviors rather than calculated corporate strategy.

Algorithmic Amplification of Emotional and Negative Content
User Engagement: Short-Term Boosts vs. Long-Term Effects
Are Platforms Deliberately Promoting Hate Content?
Platform Statements on Content Promotion and Incentives
Investigative Findings and Scientific Perspectives
Conclusion: Balancing Claims and Evidence

Algorithmic Amplification of Emotional and Negative Content

Social media feeds are typically curated by algorithms that prioritize engaging content – posts likely to get clicks, comments, shares, or longer watch time [1]. This engagement-driven design often leads to amplification of emotionally charged or divisive material. In other words, the algorithms don't consciously seek out “hate” or negativity, but they boost whatever sparks user reactions, and extreme or negative posts frequently do. For example, a recent experiment on Twitter (pre-“X”) found that its engagement-based ranking algorithm significantly amplified content with strong emotional and divisive cues – notably, tweets expressing out-group hostility (us-vs-them anger) were shown more in algorithmic feeds than in chronological feeds[2]. Users in this study reported that these algorithmically boosted political tweets made them feel worse about the opposing group, even though the users did not actually prefer such provocative content in their feeds[2]. This shows how the algorithm’s focus on engagement can push polarizing posts beyond what people say they want to see.

Multiple studies have observed that content evoking moral outrage or negative emotions spreads more virally online. A 2017 peer-reviewed study of Twitter concluded that each additional moral-emotional word in a tweet (terms that convey outrage, disgust, etc.) increased the tweet’s retweet rate by about 17% on average[3]. In practice, this means a tweet containing charged terms about a political opponent or a hot-button issue is more likely to go viral than a neutral post. Similarly, Facebook’s own engineers found that posts prompting the “angry” reaction tended to get disproportionately high reach. In 2018, Facebook’s algorithm was weighting reaction emojis more than likes – with “anger” reactions weighted five times as much as a like – in an effort to promote content that sparked interaction. The result, however, was that the most commented and reacted-to posts were often those that “made people the angriest,” favoring outrage and low-quality, toxic content [1]. Users complained about the prevalence of angry, divisive posts, and Facebook eventually dialed back the weight of the anger emoji (reducing it from 5× a like to zero by 2020) to stop over-promoting anger-inducing content [1]. This case highlights that the algorithm was amplifying negative material because of high engagement, not because Facebook deliberately wanted to spread anger – and when the effect became clear, the platform adjusted the algorithm.

Research in the misinformation realm reinforces the idea that engagement-driven algorithms inherently favor startling or extreme content. A well-known study in Science analyzed millions of tweets and found “falsehoods travel faster than the truth” on Twitter, meaning misinformation (which often has sensational or emotionally charged narratives) spreads more widely and quickly than factual news[11]. Other analyses have noted that politically extreme sources tend to generate more user interactions than moderate or centrist sources on social media[11]. In short, posts that trigger outrage, fear, or strong emotions (including hate speech or divisive propaganda) can perform extremely well in terms of likes, shares, and comments. The algorithms, in chasing those metrics, will keep showing such posts to more users – thus amplifying their reach. This algorithmic amplification is often indistinguishable from “promoting” the content, even if promotion was not the intentional goal. Critics argue this creates a dangerous feedback loop: users react to inflammatory material, the platform algorithm boosts it to more people, provoking even more reactions.

However, it’s important to note that algorithmic amplification is largely agnostic to the actual sentiment or truthfulness of content – the machine learning systems typically optimize for engagement signals, not whether a post is hateful or beneficial. As one scholarly review put it, “Engagement metrics primarily promote content that fits immediate human social and affective preferences and biases rather than quality content or long-term values.”[1] If a user base tends to interact more with cute cat videos, the algorithm will amplify those; if they interact more with toxic memes, it will amplify those. Unfortunately, human psychology and attention biases mean that, in aggregate, emotionally charged negative content often rises to the top.

User Engagement: Short-Term Boosts vs. Long-Term Effects

There is growing evidence that the content that maximizes short-term engagement is not always conducive to long-term user satisfaction or retention. In the immediate moment, people tend to pay attention to negative, shocking, or hate-filled content, which drives clicks and interaction metrics upward. But over time, a feed flooded with toxic or upsetting posts can alienate users, making them anxious, unhappy, or likely to disengage from the platform. Social media companies have themselves recognized this balance. Facebook’s 2018 overhaul of its News Feed algorithm (the “Meaningful Social Interactions” change) was explicitly aimed at improving long-term user well-being, even at the cost of some short-term engagement. Mark Zuckerberg noted that the change – which showed more posts from friends and family instead of endless viral videos – led people to spend less time on Facebook, but was done because internal research indicated it was “the right thing for people’s well-being.”[5] This suggests Facebook saw that chasing maximal viral engagement (often driven by clickbait or emotionally charged public content) was making the user experience worse in ways that could hurt the platform in the long run.

Evidence from Facebook’s own testing underscores the short-vs-long-term tradeoff. As mentioned, when the company had initially juiced the News Feed with more “provocative” content by overweighting reactions, it indeed caused a spike in interactions – but that came with a wave of user backlash and “decreases in interaction” over time as people became fed up with the divisive tone [1]. In response, Facebook had to course-correct the algorithm to deemphasize outrage. In other words, angry, negative posts drove quick engagement numbers but ultimately started to drive users away, which is not a sustainable outcome for a social network.

Academic studies mirror these observations. The Twitter experiment cited earlier revealed an interesting disconnect: content that the algorithm thought would engage people (e.g. partisan outrage tweets) did get high engagement in the short run, but users reported lower satisfaction with their feeds when exposed to that content[2]. When asked, many users said they would prefer more content that was less angering or divisive, suggesting that what grabs our attention in the moment isn’t necessarily what we want to consume continually[2]. This hints at a potential long-term effect: if people keep getting a diet of content that they find stressful or misaligned with their true preferences, they may use the platform less or feel worse about it over time.

Psychological and social well-being research backs up the idea that constant exposure to negativity can reduce user engagement and health in the long term. Studies of “digital well-being” note that heavy social media use, especially consumption of hostile or negative material, can increase anxiety, depression, and fatigue[11]. Users who feel harassed or who only see toxic discourse might eventually withdraw – effectively churning out of the platform to protect their mental health. This outcome is obviously bad for a company’s long-term user retention. Former Facebook executive Tim Kendall testified that the services he and others built ended up “torn[ing] people apart with alarming speed and intensity,” eroding shared understanding and even, in his view, pushing society toward conflict[11]. While extreme, his remarks underline that if a platform becomes synonymous with hate and hostility, it risks losing broad user trust and participation over time.

Platforms have a strong incentive to avoid that fate. Their business depends not just on one-time clicks, but on keeping users coming back day after day. If the timeline is full of anger-inducing or disturbing posts, many users eventually tune out. This is why we’ve seen moves like Facebook’s pivot to “well-being” metrics, Instagram experimenting with hiding “like” counts (to reduce pressure and negativity), or YouTube adjusting recommendations to down-rank what it calls “borderline” content. All these changes reflect an understanding that sustainable engagement requires balancing short-term attention grabs with long-term user satisfaction. As one research article observed, if algorithms were tweaked to optimize for more meaningful or reflective user preferences (instead of raw clicks), we might see “a reduction in angry, partisan, and out-group hostile content” in feeds – albeit with potential trade-offs like smaller-scale echo chambers [2]. The challenge for platforms is finding that equilibrium where users are both engaged and happy enough to stick around.

Are Platforms Deliberately Promoting Hate Content?

A crucial question is whether social media companies intentionally design their algorithms to promote hate speech or extremely divisive content, or if these outcomes are unintended side effects. In public discourse, critics often accuse the platforms of knowingly pushing harmful content because “it’s profitable.” However, peer-reviewed research has not produced evidence that companies are actively manipulating algorithms specifically to spread hate speech. Instead, studies indicate that algorithms amplify engagement, whatever its source, and that user behavior is a major driver of what content circulates.

Several large-scale analyses suggest that the algorithmic effect on what people see, while significant, is often secondary to users’ own choices (whom you follow, what you click) in determining exposure to content. A Facebook-based study published in Science found that a user’s social network and personal choices influenced the content they saw far more than the News Feed ranking algorithm did [1]. In other words, if you only friend or follow people who share hateful or extreme posts, you will mostly see that kind of content regardless of the algorithm – and if you follow diverse, civil voices, the algorithm isn’t likely to override that by shoving hate posts at you out of the blue. Likewise, research on Google and YouTube has shown that user preferences heavily shape outcomes. One study found users clicked on more partisan news sources than a neutral Google Search algorithm would typically recommend, implying people self-select into bias beyond what the algorithm suggests[1]. And regarding YouTube, a 2021 peer-reviewed study tracked hundreds of thousands of users and concluded that the platform’s recommendation algorithm rarely drives people from mainstream videos into extremist rabbit holes. In that study, only about 1 in 100,000 viewers who began watching moderate content later progressed to far-right videos via recommendations – a vanishingly small fraction [1]. Most people who watched far-right or hate content on YouTube arrived there by deliberately searching for it or via external links, not because YouTube’s algorithm force-fed it to them [1]. And those who were prone to consuming such content often subscribed to those channels or consistently sought them out, meaning the demand was user-driven.

These findings support the view that platforms are not secretly plotting to promote hate, but their neutral goal of maximizing engagement can incidentally result in harmful content getting amplified. No known peer-reviewed study documents a case of engineers tweaking code with the express purpose “let’s show users more hate speech.” In fact, outright hate speech (e.g. racial slurs, direct harassment) violates the terms of service of major platforms and is subject to removal when detected. So, if “hate content” is being promoted, it’s usually content that skirts the line of policy — divisive, misleading, or inflammatory material that provokes reaction without using bannable language. The algorithms don’t have a built-in moral compass to down-rank something just for being socially harmful; they only see the engagement metrics. As one comprehensive review notes, current evidence “neither shows that algorithms cause echo chambers, nor that echo chambers cause polarization” on their own [1]. That is, algorithms by themselves are not proven to create hardened hate-filled filter bubbles out of neutral users. The polarization and extreme discourse we observe online arise from a mix of human tendencies, social dynamics, and algorithmic amplification – with the algorithm part being largely reactive to what engages users, rather than an aggressive agenda set by the company to promote one type of content.

It’s also worth noting that lack of transparency makes it difficult for independent researchers to fully answer this question. Platforms closely guard their algorithmic data, citing security (if PR agencies know how the algorithms work they can and will exploit them) and competition, which means academics often have to rely on observational or limited data. A report by researchers at NYU’s Cybersecurity for Democracy project pointed out that without greater data access, we can’t definitively assess how much algorithms might favor or suppress certain content[11]. Thus, absence of evidence is not exactly evidence of absence – but the patterns we do see (engagement rules, user choice driving exposure, etc.) align more with unintended consequences than with deliberate malice. To date, the consensus in the scientific community is that algorithmic promotion of hate is an emergent phenomenon, not an intentional design – a byproduct of algorithms optimizing for engagement in a media ecosystem where outrage often wins attention[1]

Platform Statements on Content Promotion and Incentives

Social media companies vehemently deny that they knowingly push hateful or harmful content – and they argue it’s not even in their interest to do so. In response to allegations (especially after whistleblower and media reports in 2021), Facebook’s CEO Mark Zuckerberg publicly stated that the idea the company allows or encourages “angry” content for profit is “deeply illogical.” He explained that *Facebook’s revenue comes from advertising, and **“advertisers consistently tell us they don’t want their ads next to harmful or angry content.”****[5]. In other words, showing users a bunch of toxic posts might increase engagement metrics briefly, but it would scare away advertisers (and potentially users), undermining the business. *“I don’t know any tech company that sets out to build products that make people angry or depressed,” Zuckerberg wrote, emphasizing that Facebook’s “moral, business, and product incentives all point in the opposite direction” of promoting harmful content[5].

Concrete actions by the companies back up these claims of incentive alignment. Zuckerberg pointed to the News Feed overhaul in 2018 (mentioned above) as evidence: Facebook knowingly made a change that led people to spend less time on the platform (a short-term engagement drop), because the company believed it would improve user experience and well-being in the long term[5]. This sacrifice of “time spent” for a healthier feed is not what one would expect if the company’s strategy was to ruthlessly maximize engagement at any cost. Facebook has also invested heavily in AI systems and human moderators to detect and remove hate speech (reporting routinely that the vast majority of hate content taken down was removed before users reported it, thanks to automated detection). While critics can argue about the effectiveness of these measures, the company line is that allowing toxic hate speech to flourish is bad for business and something they actively work against.

Other platforms make similar assertions. YouTube, for instance, has publicly detailed efforts to curb the spread of what it calls “borderline content” – videos that don’t explicitly break rules but come close (e.g. conspiracy theories, incendiary propaganda). In late 2019, YouTube announced it had implemented over 30 changes to its recommendation algorithm to reduce recommendations of such borderline and harmful content, resulting in a reported 70% drop in watch time from recommendations for that material[6]. This is essentially YouTube saying: we don’t want to purely optimize for watch time if it means sending users into a toxic spiral that could damage trust in the platform. It’s an example of aligning the algorithm with long-term quality metrics over short-term popularity.

Twitter (now X) has also publicly grappled with this issue. In 2018, then-CEO Jack Dorsey acknowledged problems with the platform’s incentive structures and funded research into measuring the “health” of conversations on Twitter (beyond just engagement numbers). Twitter introduced prompts like “Want to read the article first?” before retweeting links, attempting to slow down impulsive sharing of potentially inflammatory content. And in a transparency report in 2021, Twitter’s team revealed findings that its home timeline algorithm tended to amplify tweets from political right-leaning accounts more than left-leaning ones – they didn’t fully understand why, but it prompted discussion about whether the algorithm should be adjusted to avoid any unintentional bias. All these moves suggest Twitter is aware that more engagement does not always mean a better experience, and that it must continually check its algorithms to ensure they’re not encouraging the wrong things.

In summary, social media companies publicly insist they do not have a motive to promote hate or extreme negativity, and they point to various product changes and policies aimed at reducing the visibility of harmful content. They stress that their long-term success depends on user trust and comfort on the platform, not just raw engagement figures. While skeptics might question the sincerity or completeness of these claims, it is clear that at least on the record, platforms see themselves as needing to limit hateful or excessively divisive content to maintain a healthy user base and advertising ecosystem.

Investigative Findings and Scientific Perspectives

The debate over algorithms and harmful content has also been informed by investigative journalism, whistleblower revelations, and independent research. These sources sometimes paint a less flattering picture of platform behavior, though it’s important to distinguish between accusations and proven facts. One of the most striking disclosures came from Facebook’s internal documents (the “Facebook Papers”) leaked in 2021, which included a 2018 presentation by Facebook researchers warning that “our algorithms exploit the human brain’s attraction to divisiveness. If left unchecked, Facebook would feed users more and more divisive content in an effort to gain user attention & increase time on the platform.”[4]. This internal memo essentially confirmed that Facebook’s own data scientists observed a tendency for the News Feed algorithm to push polarizing, controversial material because it maximized engagement. The memo suggested interventions to mitigate these effects. However, according to reporting by The Wall Street Journal, Facebook’s leadership shelved or slowed many of those proposed fixes, partly out of concern that reducing the virality of divisive content would disproportionately affect conservative pages and spark political backlash[4]. This revelation has fueled public cynicism — the idea that Facebook’s growth and avoidance of controversy took priority over clamping down on harmful algorithmic tendencies.

Whistleblower testimonies have echoed these claims. Frances Haugen, a former Facebook product manager who leaked documents in 2021, testified that Facebook routinely chose “profit over safety,” alleging that when the platform had opportunities to make the platform less hate-filled or angry, it often resisted if it meant lowering engagement. She pointed out, for instance, that the company disbanded its Civic Integrity team after the 2020 U.S. election and loosened certain safeguards, after which polarizing content and misinformation surged again. Haugen’s core argument was that Facebook knew its algorithm’s emphasis on engagement was contributing to the spread of hate speech and misinformation, yet was slow to change because those divisive posts kept users clicking and scrolling in the short term[5,4]. These claims grabbed headlines and led to Congressional hearings. It’s crucial to note, however, that Haugen’s assertions, while backed by internal research slides, are not the same as peer-reviewed scientific conclusions – they represent one insider’s account of corporate behavior. The company strongly disputed her characterization, as discussed earlier, pointing to their investments in content moderation and adjustments for well-being.

From a scientific perspective, many researchers caution against oversimplified narratives. While acknowledging problems, scholars often emphasize that the “algorithm made me do it” explanation for societal hate and polarization is too crude. Empirical studies on polarization have found mixed results regarding social media’s role. For example, one field experiment actually showed that when people on Twitter were exposed to more posts from their political opposites (i.e. breaking their personalized echo chamber), they became more polarized in their views, not less [1]. This implies that algorithms which don’t filter out opposing views (thus exposing users to content they strongly dislike) can also increase hostility. In other words, whether algorithms show you more of what you like (creating a comfortable echo chamber) or show you things you hate (which anger you), either scenario can potentially amp up division – it’s not a simple one-directional effect.

Another insight from research is that the supply of extreme content meets a demand. Extremist or hate content often originates from specific fringe communities or media sources. Studies of YouTube’s ecosystem find that surges in far-right video views around 2016-2017 correlated with high interest in those topics and a relative lack of milder alternatives for certain viewpoints[1]. In essence, if millions of users are actively seeking inflammatory content, the algorithms (unless heavily curated otherwise) will serve it to them because that’s what’s being clicked on. Some scholars like Kevin Munger and Joseph Phillips argue that the “rabbit hole” problem (users being radicalized by ever more extreme recommendations) has been overstated, and that audience preferences and social influence outside the platform play a larger role in guiding people to hateful content [1]. However, this view doesn’t exonerate the platforms – it simply suggests that solving the issue is as much about changing user behavior and demand as it is about changing the algorithm.

Given the complexity, scientific critiques often call for more transparency and data-sharing from platforms to enable deeper study. Regulators and researchers are pushing initiatives to audit algorithms for biases or harmful impacts. For instance, the EU’s Digital Services Act now requires major tech platforms to allow independent vetting of their algorithms’ risk impact, precisely because so far, much of what we “know” has come from either internal leaks or limited external studies. Only with fuller data can researchers determine, for example, if an algorithm tweak is subtly pushing hate content or if it’s purely reacting to user signals. Until then, we rely on partial evidence: a combination of peer-reviewed studies (which, so far, suggest no deliberate hate-profiteering but do highlight inadvertent amplification), plus whistleblower and journalistic reports (which suggest some knowing negligence or slow action by companies on this problem).

Conclusion: Balancing Claims and Evidence

In examining whether social media platforms actively promote hateful content or simply amplify what users engage with, the most substantiated answer is that it’s largely the latter – amplification of engagement – with a critical caveat that this amplification can indeed flood feeds with harmful material. Peer-reviewed research paints a picture where algorithms are guided by engagement metrics and human behavior: they turbocharge content that triggers reactions (often outrage or fear), which can include hate and extremism, but they are not consciously plotting to prefer “hate speech” specifically [1]. Short-term engagement spikes from negative content are well-documented, whereas the long-term downsides – user fatigue, dissatisfaction, polarization – are increasingly coming to light, prompting both academic and internal recognition that unbridled engagement optimization isn’t sustainable [1,2].

On the other hand, public discourse and some investigative reports have accused companies of nefarious motives. It’s true that internal documents show companies were aware of the problem (that their algorithms can escalate divisive/harmful content)[4], and at times they struggled or hesitated to fully address it due to business concerns or political optics[4]. But it’s a leap from this to say platforms want to promote hate. So far, no empirical study conclusively demonstrates a deliberate corporate strategy to maximize hate speech exposure – and company officials strongly refute that idea, citing business incentives that run counter to it[5]. In fact, platforms have made high-profile changes (like Facebook’s feed overhaul, YouTube’s crackdown on borderline content) that acknowledge the issue and attempt to rein in the very engagement-driven excesses that fuel harmful content [5,6].

In summary, social media algorithms do amplify content that users (in aggregate) respond to – and unfortunately, that can mean a lot of incendiary and negative posts get amplified. Users tend to engage with shocking or emotionally charged content, and the algorithm dutifully serves it up, creating a cycle that can look like promotion of hate. However, framing it as intentional promotion by the platform oversimplifies the situation. The weight of peer-reviewed evidence suggests unintentional amplification and a misalignment between short-term engagement and long-term user well-being, rather than a grand plot to spread hate for profit. Platforms themselves claim to be moving toward models that emphasize user satisfaction and safety over raw engagement, though skeptics argue they could do more, faster. Going forward, ongoing independent research – with greater access to platform data – is needed to continue separating myth from reality and to hold platforms accountable to the effects of their algorithmic choices, whether inadvertent or not.

Sources

[1] Metzler, H., & Garcia, D. (2023). Social Drivers and Algorithmic Mechanisms on Digital Media. Perspectives on Psychological Science, 19(5), 735-748.

[2] Smitha Milli, et al., User Satisfaction, and the Amplification of Divisive Content on Social Media, 24-01 Knight First Amend. Inst. (Jan. 3, 2024)

[3] W.J. Brady,J.A. Wills,J.T. Jost,J.A. Tucker,& J.J. Van Bavel, Emotion shapes the diffusion of moralized content in social networks, Proc. Natl. Acad. Sci. U.S.A. 114 (28) 7313-7318 (2017).

[4] Zhang, S. (Facebook internal research, 2018). "Our algorithms exploit the human brain’s attraction to divisiveness..." (Facebook internal slide, as quoted in WSJ).

[5] Zuckerberg, M. (2021). Zuckerberg says claims about FB prioritising profit over safety untrue, Marketing-Interactive

[6] YouTube Official Blog (2019). Raising authoritative content and reducing borderline content, YouTube’s announcement of algorithm changes

[7] Eytan Bakshy et al., Exposure to ideologically diverse news and opinion on Facebook. Science348, 1130-1132(2015). DOI:10.1126/science.aaa1160, Science

[8] Hosseinmardi H, Ghasemian A, Clauset A, Mobius M, Rothschild DM, Watts DJ. Examining the consumption of radical content on YouTube. Proc Natl Acad Sci U S A. 2021 Aug 10;118(32):e2101967118. doi: 10.1073/pnas.2101967118. PMID: 34341121; PMCID: PMC8364190.

[9] C.A. Bail,L.P. Argyle,T.W. Brown,J.P. Bumpus,H. Chen,M.B.F. Hunzaker,J. Lee,M. Mann,F. Merhout,& A. Volfovsky, Exposure to opposing views on social media can increase political polarization, Proc. Natl. Acad. Sci. U.S.A. 115 (37) 9216-9221 (2018).

[10] Guess, Andrew & Lyons, Benjamin & Nyhan, Brendan & Reifler, Jason. (2018). Avoiding the echo chamber about echo chambers: Why selective exposure to like-minded political news is less prevalent than you think.

[11] Lauer, D. Facebook’s ethical failures are not accidental; they are part of the business model. AI Ethics 1, 395–403 (2021). https://doi.org/10.1007/s43681-021-00068-x

tspi.at

Diode based passive RF frequency doublers - the basics

Introduction

A recap on the Shockley equation for diodes

The Anti-Parallel Diode pair

Odd order single diode harmonics generation

Conclusion

Appendix: Exponentials of sine functions

Appendix: The Taylor expansion of the cosh

Recovering 18650 Lithium Cells from Damaged Battery Packs

Introduction

A word on economics of this project

Used tools

Disassembly of Battery Packs

Selecting Candidates

Recovering the cells

Phase 1: Very low current, 3.3V target voltage

Phase 2: Using a Standard Charge Controller up to 1A

Phase 3: Monitoring Voltage Over Time

Building Battery Packs

Cell Log Example

Proper Disposal of Unsafe Cells

Conclusion

Overridable Library Code with Weak Linkage in ANSI C (GCC Microcontroller Edition)

What Is Weak Linkage?

1. Weak Functions

Library Code

Application Code

2. Weak Global Variables

Library Code

Application Code

3. Function Pointers

Library Code

Application Code

Makefile-Friendly AVR Demo

File Structure

libuart.h

libuart.c

main.c

Makefile

Summary Table

Conclusion

Evolution of Internet Content Creation Across Generations (1993–Present)

📚 Pre-Internet Era (~pre-1990)

🌐 Early Web Era (1993–1999)

📩 Blogosphere & Forums (2000–2006)

📱 Social Media Emergence (2007–2012)

📲 Mobile & Algorithmic Era (2013–2018)

🔻 Compression & Passive Era (2019–Present)

Conclusion

In Defense of Imagination: Why AI Art Is Not Theft, and What It Enables

Art Is More Than Execution

Is AI Plagiarizing Artists?

AI Enables New Forms of Creation

What AI Can’t Replace

What AI Can Replace

Conclusion: Expression for the Many, Not the Few

Expanding GPU Capabilities on Notebooks and Mini PCs Without PCIe Slots via M.2 NVMe Slots

Introduction

Why Expand GPUs on a Notebook or Mini PC?

The Hardware Setup

Step-by-Step Installation

Step 1: Identify M.2 Slot

Step 2: Connect M.2 NVMe to PCIe Adapter

Step 3: Install the GPU(s)

Step 4: Powering the GPUs

Step 5: Configure Software

Challenges and Workarounds

Conclusion

Architecting Intelligence: A Comprehensive Guide to LLM Agent Patterns and Behaviors

Overview of patterns

Stepwise Iteration Patterns

ReAct (Reason + Act) Loop

Tree of Thought (ToT)

Self-Ask with Search

Research-Oriented Patterns

Multi-Hop Retrieval

Tool-Triage Pattern

Dynamic Query Reformulation

Introspective and Self-Evaluative Patterns

`libuart.h`

`libuart.c`

`main.c`

`Makefile`

Logging Configuration with `dictConfig`