A spider (also called a web crawler or bot) is an automated program that browses the internet to index web pages. These programs are often used by search engines like Google, Bing, or Yahoo to discover and update content in their search index.
Starting Point: The spider begins with a list of URLs to crawl.
Analysis: It fetches the HTML code of a webpage and analyzes its content, links, and metadata.
Following Links: It follows the links found on the page to discover new pages.
Storage: The collected data is sent to the search engine’s database for indexing.
Repetition: The process is repeated regularly to keep the index up to date.
Search engine optimization (SEO)
Price comparison websites
Web archiving (e.g., Wayback Machine)
Automated content analysis for AI models
Some websites use a robots.txt file to specify which areas can or cannot be crawled by a spider.
A crawler (also known as a web crawler, spider, or bot) is an automated program that browses the internet and analyzes web pages. It follows links from page to page and collects information.
Search Engines (e.g., Google's Googlebot) – Index web pages so they appear in search engine results.
Price Comparison Websites – Scan online stores for the latest prices and products.
SEO Tools – Analyze websites for technical errors or optimization potential.
Data Analysis & Monitoring – Track website content for market research or competitor analysis.
Archiving – Save web pages for future reference (e.g., Internet Archive).
Starts with a list of URLs.
Fetches web pages and stores content (text, metadata, links).
Follows links on the page and repeats the process.
Saves or processes the collected data depending on its purpose.
Many websites use a robots.txt file to control which content crawlers can visit or ignore.
An Internationalized Resource Identifier (IRI) is an extended version of a Uniform Resource Identifier (URI) that supports Unicode characters beyond the ASCII character set. This allows non-Latin scripts (e.g., Chinese, Arabic, Cyrillic) and special characters to be used in web addresses and other identifiers.
A-Z
, 0-9
, -
, .
, _
), IRIs allow characters from the entire Unicode character set.https://de.wikipedia.org/wiki/Überblick
https://de.wikipedia.org/wiki/%C3%9Cberblick
Ü
is encoded as %C3%9C
)IRIs are defined in RFC 3987 and are supported in modern web technologies like HTML5, XML, and RDF.
IRIs make the internet more linguistically inclusive by allowing websites and resources to be referenced using non-Latin characters, improving accessibility worldwide.
A SUT (System Under Test) is the system or component being tested in a testing process. The term is commonly used in software development and quality assurance.
A typical testing process includes:
The Whoops PHP library is a powerful and user-friendly error handling tool for PHP applications. It provides clear and well-structured error pages, making it easier to debug and fix issues.
✅ Beautiful, interactive error pages
✅ Detailed stack traces with code previews
✅ Easy integration into existing PHP projects
✅ Support for various frameworks (Laravel, Symfony, Slim, etc.)
✅ Customizable with custom handlers and loggers
You can install Whoops using Composer:
composer require filp/whoops
Here's a simple example of how to enable Whoops in your PHP project:
require 'vendor/autoload.php';
use Whoops\Run;
use Whoops\Handler\PrettyPageHandler;
$whoops = new Run();
$whoops->pushHandler(new PrettyPageHandler());
$whoops->register();
// Trigger an error (e.g., calling an undefined variable)
echo $undefinedVariable;
If an error occurs, Whoops will display a clear and visually appealing debug page.
You can extend Whoops by adding custom error handling, for example:
use Whoops\Handler\CallbackHandler;
$whoops->pushHandler(new CallbackHandler(function ($exception, $inspector, $run) {
error_log($exception->getMessage());
}));
This version logs errors to a file instead of displaying them.
Whoops is mainly used in development environments to quickly detect and fix errors. However, in production environments, it should be disabled or replaced with a custom error page.
SSH (Secure Shell) is a network protocol that establishes an encrypted connection between two computers. It is mainly used for securely logging into remote systems (e.g., servers) and executing commands.
ssh username@server-ip
scp file.txt username@server-ip:/destination-folder/
ssh-copy-id username@server-ip
SSH is commonly used by developers and system administrators to manage servers or establish secure connections for other applications.
Swift is a powerful and user-friendly programming language developed by Apple for building apps on iOS, macOS, watchOS, and tvOS. It was introduced in 2014 as a modern alternative to Objective-C, designed for speed, safety, and simplicity.
Swift is primarily used for Apple platforms but can also be utilized for server-side applications and even Android or Windows apps in some cases.
TortoiseGit is a graphical user interface (GUI) for Git, specifically designed for Windows. It is an extension for Windows Explorer, allowing users to manage Git repositories directly via the context menu.
✅ Windows Explorer Integration → No separate tool needed; everything is accessible via the right-click menu
✅ User-Friendly → Ideal for those unfamiliar with the Git command line
✅ Visual Support → Changes, diffs, logs, and branches are displayed graphically
✅ Push, Pull, Commit & Merge → Perform standard Git operations via the interface
✅ Support for Multiple Repositories → Manage multiple projects simultaneously
TortoiseGit requires a Git installation (e.g., Git for Windows) to function.
➡ Download & More Info: https://tortoisegit.org/
The Fetch API is a modern JavaScript interface for retrieving resources over the network, such as making HTTP requests to an API or loading data from a server. It largely replaces the older XMLHttpRequest
method and provides a simpler, more flexible, and more powerful way to handle network requests.
fetch('https://jsonplaceholder.typicode.com/posts/1')
.then(response => response.json()) // Convert response to JSON
.then(data => console.log(data)) // Log the data
.catch(error => console.error('Error:', error)); // Handle errors
Making a POST Request
fetch('https://jsonplaceholder.typicode.com/posts', {
method: 'POST',
headers: {
'Content-Type': 'application/json'
},
body: JSON.stringify({ title: 'New Post', body: 'Post content', userId: 1 })
})
.then(response => response.json())
.then(data => console.log(data))
.catch(error => console.error('Error:', error));
✅ Simpler syntax compared to XMLHttpRequest
✅ Supports async/await
for better readability
✅ Flexible request and response handling
✅ Better error management using Promises
The Fetch API is now supported in all modern browsers and is an essential technique for web development.
A Single Page Application (SPA) is a web application that runs entirely within a single HTML page. Instead of reloading the entire page for each interaction, it dynamically updates the content using JavaScript, providing a smooth, app-like user experience.
✅ Faster interactions after the initial load
✅ Improved user experience (no full page reloads)
✅ Offline functionality possible via Service Workers
❌ Initial load time can be slow (large JavaScript bundle)
❌ SEO challenges (since content is often loaded dynamically)
❌ More complex implementation, especially for security and routing
Popular frameworks for SPAs include React, Angular, and Vue.js.